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PREFACE 


In so far as quantum mechanics is correct, chemical questions are 
problems in applied mathematics. In spite of this, chemistry, because 
of its complexity, will not cease to be in large measure an experimental 
science, even as for the last three hundred years the laws governing the 
motions of celestial bodies have been understood without eliminating 
the need for direct observation. No chemist, however, can afford to 
be uninformed of a theory which systematizes all of chemistry-even 
though mathematical complexity often puts exact numerical results 
beyond his immediate reach. In this book we have attempted to put 
into a systematic, condensed form the tools which have been found 
useful in efforts to understand and develop the concepts of chemistry 
and physics. 

We have included a wider range of subject matter than is to be found 
in other introductory textbooks in quantum mechanics and have indi- 
cated this in the title. The effort has been to build a reasonably com- 
plete and unified logical system on which the serious student can con- 
tinue building. This has necessitated a somewhat formal mathematical 
style. Much interesting illustrative material might have been added. 
Including it would have lengthened a book already long enough. 
Instead we have tried to present group theory, statistical mechanics, 
and rate theory in a usable, if condensed, form. 

The book has been written at the level of the graduate student in- 
chemistry. It contains somewhat more material than has been pre- 
sented in the year course given at Princeton since 1931. The good 
student who has mastered calculus will be able to follow the arguments. 
The way will be made easier by whatever he has learned of differential 
equations, vector analysis, group theory, and physical optics. Often 
unfamiliarity is mistaken for inherent difficulty. The unavoidable for- 
mality of quantum mechanics looks much worse on first reading than 
it is. The important fact emerges from the experience of the last 
ftventy years that mastery of the subject is worth what it costs in 
sffort. 

$Our general indebtedness to others is great and is acknowledge in 
'art throughout the text and in Appendix IX. We want, in addition, 
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CHAPTER I 


INTRODUCTION: THE OLD QUANTUM THEORY 

la. The Composition of Matter. It is now well established that 
all matter is composed of a small number of kinds of particles. We 
cannot discuss here the long chain of experimental evidence on which 
our belief in the existence of these particles is based, but at present the 
following types are known : 

Electrons , with a mass of 9.107 X 10“ 28 gram and a negative charge 
of 4.8025 X 10“ 10 electrostatic unit. 

Protons) with a mass of 1.6725 X 10~ 24 gram and a positive charge 
equal numerically to that of the electron. 

Neutrons , with a mass very nearly equal to that of the proton but 
carrying no electric charge. 

In addition, the following particles have been observed in nuclear 
phenomena but are not of chemical importance: 

Positrons , with the mass of the electron and the charge of the proton. 

Mesotrons or mesons, with the charge of the electron but with a mass 
intermediate between that of the electron and that of the proton. 

Neutrinos , with a mass of the same order of magnitude as that of the 
electron but without electric charge. 

The evidence for the last two particles is less conclusive than that 
for the first four. 

From the viewpoint of the chemist, matter may be regarded as being 
composed of atomic nuclei, of charge +Ze and mass M, where Z is the 
atomic number and e the magnitude of the electronic charge, and elec- 
trons of charge — e and mass m. If matter is composed only of these 
particles, then it must follow that all the properties of matter are 
properties of assemblies of these particles. A knowledge of the forces 
between these particles and their laws of motion is at least in principle 
sufficient to determine the behavior of all matter under any conditions. 

It was only natural that the first attempts in this direction should 
have been made with the laws of classical mechanics as laid down by 
Newton. It was soon found, however, that these laws were inadequate 
to cope with problems of atomic mechanics. 

lb. Black-Body Radiation. The failure of classical mechanics in 
this realm was first recognized clearly by Planck 1 in 1901. By applying 

1 Planck, Ann. Physik , 4, 553 (1901). 
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classical mechanics to the problem of the equilibrium between a perfect 
absorber and emitter of radiation, a so-called black body, and its radia- 
tion field, Rayleigh and Jeans had found that the amount of energy 
per unit volume, E v dv ) in the frequency range between v and v + dv, 
in equilibrium with a black body at the absolute temperature T should 
be 


E v dv — 


8tt v 2 kT , 
o — dv 


1*1 


where k is the Boltzmann constant and c is the velocity of light. This 
law is in hopeless disagreement with the experimental facts, since it 
states that E v becomes infinite as v approaches infinity, whereas experi- 
mentally E v approaches zero for large values of v. 

Planck overcame this difficulty only by making a violent departure 
from the concepts of classical mechanics. In order to simplify the 
problem he assumed that the black body was composed of harmonic 
oscillators, that is, of small charged particles bound by Hooke's law 
forces to their equilibrium positions. According to classical theory 
such oscillators will absorb or emit radiation of their natural frequency, 
the absorption or emission taking place continuously. According to 
Planck's “quantum theory," however, this absorption or emission is 
not continuous. Instead, the energy of each oscillator must always 
be an integral multiple of a certain “ quantum " of energy, the size 
of the quantum depending on the natural frequency v 0 of the oscillator 
according to the law 

e = hv o 1*2 


where e is the quantum of energy and h is a factor of proportionality, 
the now famous Planck's constant, the value of which is 6.6242 X HT -27 
erg * seconds. Application of the same statistical methods as those used 
by Rayleigh and Jeans now led Planck to the black-body radiation 
formula 


E v dv 


Sir hv 3 


1 

hv 

ekr- 


— dv 

1 


1*3 


which is in agreement with experiment. It will be noted that, when the 
condition hv *C kT is satisfied, the two expressions for the radiation 
density are identical. 

lc. The Photoelectric Effect. The next successful application of 
Planck's quantum hypothesis was made by Einstein. 2 It was known 
that, when light falls on a clean metal surface, electrons are emitted. 

2 A. Einstein, Ann . Physik, 17, 132 (1905). 
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If the kinetic energy of the electrons is plotted against the frequency 
of the incident light, we obtain a graph of the type shown in Figure 1*1. 
Varying the intensity of the incident light at constant frequency does 
not affect the kinetic energy of the emitted electrons but merely changes 
the number which are emitted in unit time. The equation of the curve 
in Figure 11 is 

Kinetic energy = h(v — v 0 ) 1*4 

where r 0 is a minimum frequency below which no electrons are emitted 
and h is numerically identical with the value required by Planck to 
make equation 1*3 reproduce the experimental data on the density of 
radiation in equilibrium with a black body. This photoelectric effect 
was completely explained by Einstein by the hypothesis that the energy 


Kinetic 

Energy 


v o v —*~ 

Fig. 11. Energy of photoelectric electrons. 

of light is not spread out through the wave, as classical electrodynamics 
would have it, but is concentrated into corpuscles, or photons, of energy 
hv. It is further assumed that the emission of an electron from the 
surface takes place only when the electron is struck by and receives 
all the energy of the photon. The kinetic energy with which the elec- 
tron is emitted will be less than hv by an amount hv 0 , the energy with 
which the electron is bound to the surface. The intensity of the light 
therefore determines only the number of electrons emitted; the fre- 
quency of the light determines their energies. 

Id. Bohr’s Theory of the Hydrogen Spectrum. Perhaps the great- 
est of these early successes of the quantum theory was the theory of the 
hydrogen spectrum as developed by Bohr. 3 Empirically, it has been 
found that all the lines in the hydrogen spectrum could be represented 
by the formula 

v = R n(y-—?) 1-5 

where n and m are integers with n < m, and where n has the values 1, 
8 N. Bohr, Phil. Mag., 26, 1 (1913). 
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2, 3, 4, 5 for the Lyman, Balmer, Paschen, Brackett, and Pfund series 
of lines, respectively. The Rydberg constant for hydrogen, R H , has 
the value 109,677.581 if the frequencies are measured in wave- 
numbers P. (P is the reciprocal of the wavelength in centimeters.) 
Bohr postulated, following Rutherford, that a hydrogen atom was 
composed of a positive nucleus around which moved one electron. 
Classically, it can be shown that an electron of mass m moving about 

a proton of mass M is equivalent to a particle of mass g = ■ ni ~ — 

m + M 

moving about a fixed center. Considering only the case in which the 
particle moves in a circular orbit of radius r under the influence of the 
coulomb forces between the two charges, we have, since the centrif- 
ugal and attractive forces must balance, 

e 2 

Mr l — J = - 1-6 


/ <&A 2 e 2 
Xdt) = ? 


where <p is the azimuthal angle. The kinetic energy T y the potentia 
energy V , and the total energy E = T + V are 


T ’** -**(&)' 


V = - 


1-7 


1 e 2 

E = T + V —T 

2 r 

Bohr now made the assumption that only those orbits are allowed 

for which the angular momentum is an integral multiple of ; that is, 

2 7T 

only those values of r are allowed for which 

( n an integer) 


^2 dtp nh 

M Jt ~ 2^ 


Using this relation, the allowed values for the energy are 

rc=-'r = _iJ_!^ == _l£! 

2 nr 2 4x 2 2 r 

or, eliminating r from these equations, 


E n = 


2xV 4 


21,2 


nh 


1-8 


1-9 


110 


Classically, such a system would radiate light of a frequency equal to 
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the frequency of revolution of the electron about the nucleus. Bohr 
therefore made the additional assumption that the atom could exist in 
the states characterized by the above energy values without radiating 
energy, and that a transition from the state with the “ quantum 
number ” m to that with the quantum number n could take place with 
emission of light of frequency 



E m ~~ E n 
ch 


Ml 


Substituting the values for the allowed energy levels into this expression 
gives the allowed frequencies 

112 

2 2 e 4 

with R h = — — . We see that (M2) is identical in form with (1-5). 
ch 

The calculated value of R n agrees with the experimental value within 
the limits of error involved in the determination of the values of the 
fundamental constants e, h , and m . 

le. The Old Quantum Theory. The work of Planck, Einstein, and 
Bohr formed the basis of what is now known as the old quantum 
theory. This method of attacking atomic problems consisted of two 
definite parts: first, the problem was solved by the methods of clas- 
sical mechanics; and then, from all the possible motions of the system, 
only those which fulfilled certain “ quantum conditions ” were kept. 
These quantum conditions, for a system of / degrees of freedom, were 
of the form 

£ Vi dq% = nh (i = 1, 2 • • •/) M3 


where is any of the coordinates representing a degree of freedom 
of the system and is the corresponding momentum. The symbol 

£ means that the integral was taken over a complete cycle if the motion 

was periodic; non-periodic motions were not quantized. The anal- 
ogous quantum condition in the above treatment of the hydrogen 
atom is given by equation 1-8, The number n on the right side of 
equation 1*13 was called the quantum number corresponding to that 
particular degree of freedom. In most cases n was taken to be an 
integer, but in some problems it proved better to use half quantum 
numbers, that is, to give n a value from the series 1/2, 3/2, 5/2 * • • . In 
this way a number (generally an infinite number) of quantized motions 
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of the system were found. The energies of these motions were called 
the energy levels of the system. These energies substituted in equa- 
tion 1*11 determined the spectrum of the system. By the absorption 
of light of the proper frequency the system could be raised from a low 
energy level to a higher one, and by the emission of light the system 
could pass from a high energy level, or excited state, to a lower one. 
Although it did not prove possible to calculate the actual energy levels 
of any atom except hydrogen (and other one-electron atoms such as 
He + and Li ++ ), the scheme of energies could be found from the spec- 
trum, and in this way a large amount of knowledge concerning atomic 
structure was gained. 

In spite of the success of the old quantum theory in simple problems, 
it finally became evident that it could not produce correct quantitative 
results in the more complicated ones. For this reason the old quantum 
theory was finally abandoned in favor of what is known as quantum 
mechanics. 

If. The Dual Nature of Light. The essential feature of quantum 
mechanics is the dualism of all the fundamental particles. At times 
these particles behave like forms of wave motion; at other times they 
exhibit the ordinary properties of particles. Consider, for example, 
the photon. Although Einstein was forced to assume that the energy 
of a light wave was concentrated into corpuscles, his success in explain- 
ing the photoelectric effect in no way invalidated the old well-tested 
evidence that light is a form of wave motion. It is just as inconceivable 
that a stream of particles should show the phenomena of diffraction as 
that a wave should suddenly concentrate its energy at one point to 
knock an electron out of a surface. 

Let us examine the conditions under which light behaves as a wave. 
In a typical diffraction experiment, light from a point or line source 
passes through a slit system and the diffraction pattern is recorded on 
a photographic plate. Now it has been found experimentally that the 
sensitization of a photographic plate is a quantum process as much as 
is the photoelectric effect. Regarded from the photon theory, there- 
fore, the experiment may be considered as the passage of a stream of 
photons from the source to the plate. If it were possible to perform 
the experiment with a single photon we could not possibly obtain the 
complete diffraction pattern; at most one grain of the emulsion on the 
plate would be sensitized. The experiment with a large number of 
photons can be regarded as the experiment with a single photon re- 
peated a large number of times. The diffraction pattern is then an 
expression of the probability that a photon emitted from the source 
will strike a given part of the plate. The waves themselves are not 
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observed in this or any other optical experiment: the actual observa- 
tions of the light are always quantized, whether we detect the light with 
a photographic plate, a photoelectric cell, or the human eye. 

Since we always observe photons, and not light waves, we must 
logically conclude that light is “ really ” a stream of photons. The 
waves are the mathematical expressions of the way in which the photons 
move. The photons of a beam of light do not obey Newton’s laws of 
motion but the laws of wave motion. 

1 g. The Dual Nature of Electrons. It is now known, as a result 
of the experiments of Davisson and Germer 4 and of G. P, Thomson , 5 
that diffraction experiments very similar to those on light may be per- 
formed with a beam of electrons. These experiments form a brilliant 
confirmation of the hypothesis, first advanced by de Broglie , 6 and put 
into mathematical expression by Schrodinger , 7 that electrons, instead 
of having laws of motion similar to the classical laws, actually obeyed 
the laws of wave motion in the same way that photons do. 

Instead of starting with the classical motion and applying quantum 
conditions to it, as in the old quantum theory, the new quantum me- 
chanics abandons classical concepts almost entirely. In the new 
quantum mechanics all our information concerning an electron is con- 
tained in the mathematical expression of a function SP. The square 
of the absolute value of this function expresses the probability of 
finding the electron at a given point. The law according to which \F 
changes with time is known, so that a statistical knowledge, but nothing 
more, can be obtained of the position of the electron at any future time. 

In the chapters to come we shall see how this new mechanics gives 
all the results of the old quantum theory and goes on to solve problems 
in which the old theory failed. Before starting a discussion of the 
principles of quantum mechanics and its application to problems of 
chemical interest, it will be of value to review, in the next chapter, the 
principles of classical mechanics, since the new mechanics is expressed 
largely in the terminology of the old. 

4 C. Davisson and L. Germer, Phys. Rev ., 30, 705 (1927). 

5 G. P. Thomson, Proc. Roy. Soc. London , 117, 600 (1928). 

6 L. de Broglie, Ann. phys.> 3, 22 (1925). 

7 E. Schrodinger, Ann . physik , 79, 361, 478; 80, 437; 81, 109 (1926). 



CHAPTER II 


THE PRINCIPLES OF CLASSICAL MECHANICS 

k 

2a. Generalized Coordinates. In classical mechanics, the motion 
of a particle is determined by Newton’s law of motion, which may ; be 
written in the form 


d 2 x 

m i?~ u 


d 2 y , 


d 2 z 


21 


where x ) y , and z are the Cartesian coordinates of the particle, m is its 
mass, and /*, f y , and f z are the three components of the total force 
acting on the particle. 

In many problems it is not convenient to use rectangular coordinates. 
In the problem of planetary motion, for example, the forces are simple 
expressions in polar coordinates but are quite complicated in rectan- 
gular coordinates. Again, if the motion is not free but is subjected to 
constraints, such as the requirement that the particle move on a given 
surface, it is usually preferable to use a coordinate system such that 
the condition of constraint takes the form of a requirement that one 
or more of the coordinates remains constant. 

The most frequently met example of a motion subject to constraints 
is the motion of a rigid body. We may regard a rigid body as a system 
of particles moving in such a way that the distance between any two 
of them remains constant. If there are N particles, their positions 
are specified by 3A r rectangular coordinates xi, yi, Z\ • • * xn, yw, *n- 
These variables are not all independent, since we must allow them to 
vary only in such ways that the distances between particles remain 
constant. If we fix the coordinates of three particles which do not lie 
in a straight line, the coordinates of all the other particles are then 
determined. Between the nine coordinates of the three chosen par- 
ticles there are three relations corresponding to the three distances 
between the particles, so that the positions of the three particles, and 
hence of the whole body, are fixed by only six variables. The only 
exception to this rule occurs when all the particles lie on a straight 
line; then the position of the whole system is seen to be determined 
by five variables. The number of variables necessary to specify the 

8 
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position of a system is known as the number of degrees of freedom of 
the system; thus, a rigid body has six degrees of freedom. 

In general theposition of a system of N particles with F degrees of 
freedom is determined by F coordinates, which we shall denote by 
qi, ■ ■ ■ q F . The conditions of constraint may then be expressed by the 
constancy of 3 N — F other variables q F+ i • ■ • qw- In terms of the 
q’a the rectangular coordinates of the particles are given by the 3 N 
functions 

= <Pi(Qi • • • Saw) Vi = (Si Saw) = <pz{qi • * * Saw) 


2-2 


*w = <PZN- 2(Sl • ■ • S3w) VN = <PZN — 1 (Si • • • S3 w) Z W — <P 3 v(?l ' * ' S8w) 

2b. "^Lagrange’s Equations. We shall now find the equations of 
motion in terms of the generalized coordinates q { . In the derivations 
which follow, it is convenient to make a slight change of notation. 
Instead of representing the Cartesian coordinates of the first particle 
by xi, 7/i, Z\, we shall use xi, x 2 , x%. For the second particle we use 
x 4 , x 5 , x 6 , and so on. We number the forces and masses in the same 
way. Newton’s laws may then be expressed by the single equation 


d 2 Xi 

mi lF 


= /, 


2-3 


Likewise, equations 2-2 become 

Xi = <Pi(qi • • • g 3 w) 2-4 

and a variation in the a:’s may be expressed in terms of a variation in 
the g’s by the relation 

dXi 

dXi = 'L — dqj 25 

i "Qi 

Suppose that the system is subjected to an infinitesimal displacement 
of the coordinates. In this displacement, the forces do an amount 
of work 
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is the generalized force associated with the generalized coordinate 
qj. Using equation 2-3, the work done may also be written as 

? tit v ^ j't j v' dXi dXi 

dW = 22mi — j dxi = £I>; -rj — dq, 2-8 

{ Cot ^ j dt 

so that we have the relation 

HZm^—dqj = ZQj dqj 2-9 

* 3 vQj y 

Since the q’ s are independent variables there is no relation between 
the dq y s ; equation 2*9 will therefore be true only if the coefficients of 
each dqj on both sides of the equation are equal. This condition gives 
us the set of j equations : 

d 2 Xi dxi 

,.2 ~ “ Qj 2- 10 

i dr dqj 

The left side of this equation can be simplified. By the ordinary 
rule for the differentiation of an implicit function, we have 

^ ^ ydXidqj 
dt j dqj dt 

so that, upon differentiating with respect to —■ = $ k , we obtain 

do 

** = — 2-12 

dqic dq k 

We can obtain the additional relation 

d dxi dii ^ ^ 

dt dq k dq k 


by noting that 


dxi _ j- d dXi dqj 
dqk J dqk dqj dt 


d dXi ^ d dXj dqj 
dt dq k j dqj dq k dt 


We now write the left side of equation 2* 10 as 
_ d 2 Xi dxi „ d ( dxi dxA 


E w 

m i ~T2 T“ 
i dt dq 


i _ v <2 / dxi dxA 
j i m% dt \ dt dqj) 


_ dxi d dxi 

~TT T 4 T~ 
i dt dt dqj 
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which becomes, after using the relations expressed by equations 2*12 
and 2*13, 

_ d x% dx% _ d ( dx% djiA dx{ dxj 

~d? dqj ~ dt \dt dtjj) ~ ^ mi ~dt % 

=I(4;[? im '(f)1)-i[?Kfy] 217 

(dx A 2 

The kinetic energy of the system is T = YL\ m i > ^he se ^ °f 

equations 2- 10 may therefore finally be written 


d dT _ dT _ a 
dt dqj dqj ’ 


218 


If the/,’s are derivable from a potential, that is, if there exists a function 
V(x\ ■ ■ ■ x 3N ) such that 


fi = 


dV 

dXi 


then 


0 = _ v — — = — 

* i dxs da-s den 


219 


2-20 


In this particular case, which is the most important one, equation 
2-18 reduces to 


d dT 
dt dqj 


(T - 7) = 0 

dqj 


2*21 


Since V is a function of the coordinates only, this equation can be 
written more simply in terms of the function L — T — V; it then 
becomes 


» d dL dL 
dt dfc dqj 


2*22 


The function L is Jmown as the Lagrangian function for the system, 
and the set of equations represented by 2*22 as the Lagrangian equa- 
tions of motion. 

Example 1. Two particles of masses mi and ra 2 are connected by 
a massless rod of length R . The system moves in a vertical plane 
under the influence of gravity. Discuss the motion. 

Let the coordinates of the center of gravity of the system be x and 
y , the y axis being vertical, and let the angle made by the line from the 
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center of gravity to the first particle with the vertical be <p. In terms 
of these coordinates, the rectangular coordinates of the two particles are 


, m 2 n ■ , m 2 

xi — x -\ R sin <p yi = y ; R cos <p 


x 2 = x 


mi + m 2 

mi 


mi + m 2 
The kinetic energy 


R sin <p y 2 = y - 


mi + m 2 

mi 

m x + m 2 


2-23 


R cos v> 


is easily found to be, in the new coordinate system, 


2-24 


where M = mi + m 2 and / = - m ' im — R 1 2 , The forces acting on the 

mi + m 2 

particles are, first, the forces due to gravity, which may be derived 
from the potential V = Mgy, where g is the gravitational constant; 
and the forces exerted by the connecting rod. Suppose that the x 
component of this force on the first particle is F . In order that the 
force may be directed along the rod, the y component must be F cot <p. 
The law of action and reaction then requires that the components of 
the force on the second particle be —F and — F cot <p. It is easily 
verified that the generalized forces due to the connecting rod vanish. 
This result is perfectly general: the generalized forces due to the con- 
straints of any system always vanish and hence never need be con- 
sidered. The Lagrangian function for the system is therefore 

1 - t - v ^ - f to 2 + (f ) 2 ] + 1 (f y - "* 225 

and Lagrange's equations of motion become 
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The first two of these equations show that the center of gravity of the 
system moves as a particle of mass M under the influence of the grav- 
itational field. The last equation shows that the angular velocity of 
the system is constant. 

A Example 2. A particle of mass m moves in a plane under the in- 
fluence of a potential which is a function only of the distance of the 
particle from a fixed point in the plane. Discuss the motion. 

Let the origin of coordinates be at the fixed point; let r be the dis- 
tance of the particle from the fixed point, and let be the angle between 
the x axis and the line from the fixed point to the particle. The rela- 
tion between the two coordinate systems is therefore 

x — r cos <p y = r sin <p 2*27 

so that the kinetic energy is 

If we represent the potential energy by V (r), the Lagrangian function is 

and the Lagrangian equations of motion are 



The first of these may be integrated to give the result 

mr 2 ~ = c = p<p 2*31 

at 


that is, the angular momentum of the particle about the fixed point is 

dtp • 

a constant. By use of this result — may be eliminated from the second 

at 

equation, giving 


d 2 r 

m -< ¥~ 


™* + 7r V{r) 


2-32 


which may be solved when an explicit form is taken for V(r). The 
second term in this equation represents the centrifugal force. 
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2c. Generalized Momenta and Hamilton’s Equations. In many 
problems it is convenient to express the kinetic energy in terms of 
momenta instead of velocities. For generalized coordinates we define 
the momentum p,- associated with the coordinate as 


Pi = 


dL 

dii 


2-33 


In order to find the laws of motion in terms of the p.’s and qjs instead 
of the qi s and q.’s, consider the differential 


dL 



dqi + dqi 
dqi 


2-34 


The coefficient of dq, is by definition p,-, while from Lagrange’s equations 


dL d dL 
dq t di dqi 


2-35 


Hence 

dL — Y (Vidqi + Pidqi) 2-36 

i 

If we subtract equation 2-36 from the identity 

d(Yj> t qi) = YiPidqi + 4,-dpi) 2-37 


we obtain 


dCLvrfi - L) = Y{q4Pi - pidqi) 

i i 


2-38 


The quantity (YPili — L) is known as the Hamiltonian of the system 

t 

and is represented by the symbol H. If H is regarded as a function 
of the p’s and q’ s, then, from 2*38, 

dH = X! (Qidpi — &%) 2*39 

i 

and by the definition of a partial derivative 




2 #° 


This set of equations is known as H amilton’s eq uati ons o f motion. 

Since the kinetic energy T is a^omogeneouTqua^r^Tc 'ftiffction of 
the £’s, and the z’s are homogeneous linear functions of the <fs, it 
follows that I 7 is a homogeneous quadratic function of the <fs; that is, 
T = where a ij may be a function of the q’s but not of the 
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#’s. If the forces are derivable from a potential, then 
Vk 


6L dT _ 

A A *“ a a ~ “ 1 “ jL, a ik$i 

oqic dq k j i 


UPkik = T.YLO'kjMi + H'L^ikMk = 2T 

k k j i k 


2-41 


H = E» — L = 2T — L = 2T — (T — V) = T + V 

k 


th^Jp is, the Hamiltonian function II is equal to the total energy of the 
system. 

Example 3. Solve example 1 by the use of Hamilton’s equations. 
The generalized momenta are : 


* 


Vt 


dL dx 

— = M — 
dx dt, 


dL „ 

Vv ~ - . — M 
dy 


(iy 

dt 


2-42 


so that 


V V 


dL ^ d(p 

dip dt 


H = 2M + p2y) + 2 l p2 + Mgy 


2-43 


and Hamilton’s equations of motion become 

dx ^Vx dp x = 

dt M dt 

dy ^Vv dPy __ 

dt M dt 

&P dp £= Q 

d/ / d£ 


2-44 


The equations are readily seen to be equivalent to those obtained from 
Lagrange’s equation. 

Any pi and the corresponding qi are said to be conjugate variables. 
By expending this definition to include any pair of variables which 
satisfy equations of the form 2*39 the concept becomes more general. 
For example, if a system of F degrees of freedom is described by the 
coordinates q\ • * • q F and the time t, with t being formally treated on 
the same basis as the q’ s, the analysis shows that for systems in which 
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the total energy i» uunstant the variable conjugate to t is the negative 
of the Hamiltonian function. 

^2d. Vibration Theory and Normal Coordinates. Many problems 
of mechanics are concerned with the vibrations of a system of particles. 
By a vibration we mean the oscillations of a system when it is slightly 
disturbed from a position of stable equilibrium. In such a motion no 
coordinate ever departs by a large amount from the value it would have 
if the system were in the equilibrium position. It is convenient in 
these problems to choose a system of coordinates such that all the g/s 
vanish at the point of equilibrium; then all the qj s will remain small 
throughout the motion. 

If we were to express the coordinates of each particle by a set of 
rectangular coordinates with the origin at the equilibrium position of 
the particle, the kinetic energy of the system would be 

r - y 245 


Usually a more general coordinate system would be used; the kinetic 
energy of the system in generalized coordinates is of the form 


T = 


i w do* dqj 

22 r / dt dt 


246 


where the a 4 */ s are functions of the q/s, but for small vibrations it will 
be a sufficiently good approximation to regard the a;/s as constants, 
with the value they have at the equilibrium position. 

The potential energy may be expanded in a Taylor’s series in the 
q 1 s about the point of equilibrium: 


v = Fo + Z (“) q* + EE\(~~) Mi 

i \dqi/ o i j 2 \dqi dqj / o 


+ 


247 


where the derivatives are evaluated at qi = 0, the position of equi- 
librium. The constant term 7 0 is arbitrary, and so for the sake of sim- 
plicity we take it to be zero. Since qi = 0 is a point of equilibrium, 


V must be a minimum at this point, so that 



= 0 . 


If we denote 


the constant 



by bij, we may therefore represent V approxi- 


mately by 


V — 2 


248 
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Lagrange’s equations for the system are then 

,2 

Ea, v J + ZM, = 0 249 

If the system has F degrees of freedom, there are F of these differential 
equations corresponding to i = 1, 2, • * • F. In order to solve these 
equations let us try to find a set of constants c* such that, if each equa- 
tion of the set is multiplied by c,* and the results added together, the 
new equations will be of the form 

^ + xg-o 2-50 

where Q is an expression of the form 

Q = JlhjQj 2*51 

j 

The equations which must be satisfied in order to obtain this result are 
^ :=: 7" 'y „ Cjbjj ^ hj 2*52 

* A i 

The equations given by the equality sign on the left are just sufficient 
to determine the c/s; the remaining equations will then give the A/s. 
If we write the left-hand equation in the form 

£ (Xo*y — bij)ci = 0 2*53 

t 

it is seen that one solution is c» = 0. Now by an algebraic theorem, 
(Appendix IV) this is the only solution unless the determinant of the 
coefficients vanishes, that is, unless 

|\a# - bij\ = 0 2*54 

This equation may be satisfied if we choose X properly. Let X x be one 
of the F roots of this equation. Then the equations 

Z(Xi«»y - bij)ci = 0 2*55 

i 

have a set of non-vanishing solutions for the c t ’s, which are unique 
except for one arbitrary constant factor. 

When this set of c/s has been determined, the /i/s are fixed by equa- 
tion 2*52. Let these be denoted by hj 1} and the corresponding Q by 
In the same way each of the other roots of 2*54 gives a set of A/s which 
in turn determines a possible Q, so that we arrive at a set of F Q% each 
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of which satisfies the equation 

^r + M2, = 0 (*-1,2, -..JO ' 2-56 


If we regard the Q’s as a new set of coordinates, these equations are just 
Lagrange’s equations in the new coordinates. Because of the simple 
form of these equations the Q’s are known as the normal coordinates of 
the system. In terms of the normal coordinates, the kinetic and po- 
tential energies have the simple form: 

T - V = |X><Q? 2-57 

a 2 F 

dQ i 

positive. But for positive X’s it is easily seen that the solution of 
equation 2-56 is 

Qi = A{ cos (V\it + €»•) 2*58 


j are real and 


If the equilibrium is stable, then all the X,’s I X 


,’s 


where Ai and €, are arbitrary constants. 

If the solution is desired in terms of the original coordinates, the g*’s 
must be expressed in terms of the Qy s by solving simultaneously the 
equations defining the Qi s. Suppose that the result is 

Qi = HgijQj 2*59 

3 

Then the equations of motion are 

Qi = HgijAj cos (V\jt + ej) 2*60 


If all the Ay s are zero except one, then each qi varies sinusoidally with 
time, each with the same phase. Such a motion is called a normal 
mode of vibration of the system. Corresponding to such a mode of vi- 

, ~~ - - — — - Vv 

bration there is a definite frequency given by vj = — - . The most 

2i7T 


general vibration of the system can be regarded as a superposition of 
the various normal modes, with arbitrary amplitudes and phases. 

If the equilibrium is unstable, the above formal treatment can still 
be carried out. In this case, however, at least one of the X’s is real and 
negative, so that the corresponding frequency is imaginary. The 
motion then is not sinusoidal, since the Q corresponding to the negative 
X will be an exponentially increasing function of the time. 

Example 4. Three springs, whose force constants are k, 2Jc, and 
k , a|5p joined in a straight line, and the ends of the^ystem are fixed. 

.W 1 "! K 
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The joints between the springs are balls of mass m (Figure 2*1). Deter- 
mine the motion if the balls are set vibrating in the line of the springs. 



Fig. 21 . 


Let the displacement of the first ball from its equilibrium position 
be X\ and that of the second ball x 2 . The changes in length of the three 
springs are then xi, x 2 — Xi, and — x 2 , so that the potential energy of 
the system is 

V = \kx i + \{2k)(x 2 — xi) 2 + \k(—x 2 ) 2 

= ffcr i — 2kxix 2 + %kx 2 2*61 

If we consider the springs to have zero mass, the kinetic energy is 


■[(fHf) 1 ] 


Lagrange's equations of motion are therefore 
d 2 x i 

m + Skxi — 2 kx 2 = 0 


m -^ 2 — %kxi + 3 kx 2 = 0 

If we multiply the first of these equations by C\ and the second by c 2 
and add, we obtain 

d 2 x i d 2 x 2 

mci ~7T + mc 2 "TT + (3&Ci 2kc 2 )xi + (~2kci + Skc 2 )x 2 = 0 2*64 

dt dl 


In order that this be of the form 2*50 we must have 


raci = ~ (3kci — 2 kc 2 ) = hi 
X 


( — 2fcci + 3kc 2 ) = h 2 


There will be a non-trivial solution for c\ and c 2 only if 
\m — 3 k 2k 

= 0 

2k \m 3 k 
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The roots of this determinant are Xi = — ; X 2 = . Substituting 

m m 

Xi in equation 2*65 gives C\ = — c 2 . Hence if we take ci, which is arbi- 
trary, to be 1, c 2 must be — 1, and 

Qi = h^Xi + h ( 2 ) x 2 = mx i — mx 2 2*67 

In the same way we find for Q 2 the value 

Q 2 = mx i + mx 2 2*68 


The equations of motion in the new coordinates are 


d 2 Qi 

dt 2 



m 


Q i=o 


d 2 Q 2 

dt 2 


+ -Q 2 =0 

m 


2*69 


The first normal mode is that in which Q 2 — 0, that is, Xi + x 2 == 0. 
In this normal mode the balls move in opposite directions with a f re- 




in the second normal mode x x = x 2 ; the balls move 


in the same direction with a frequency — 

27 r 




CHAPTER III 

THE PRINCIPLES OF QUANTUM MECHANICS 


3a. The Uncertainty Principle. A typical problem in classical 
mechanics involves the finding of the values of the various dynamical 
variables of a system, given the values of the p’s and q’s at one instant. 
In general the motion of a system of / degrees of freedom requires the 
knowledge of 2 f variables at some time in order to be completely speci- 
fied. It is assumed in classical mechanics, that it is possible to deter- 
mine these 2 / variables to any desired degree of accuracy. Within the 
last two decades it has been found that this specification cannot be 
carried out beyond a certain limit. 



Fig. 3-1. 


Suppose that we set out to measure the position and momentum of 
an electron in order to determine its motion. The natural instrument 
to use in the determination of its position is a microscope, illustrated 
diagrammatically in Figure 3-1. The accuracy with which a microscope 
can measure distance along the x direction is limited by the wavelength 

^ * 

of the light used, this limit being — ; — . At first it might seem that 

2 sin € 

this difficulty could be overcome by using light of a very short wave- 
length. However, a new difficulty then arises: the Compton effect. 

hv 

If a photon of energy hv and momentum — strikes an electron at rest, 

c 

the photon after the collision will have an energy hv and momentum 
hv' 

— , while the electron will have a kinetic energy \mv 2 and momentum 
0 

mv, where m is the relativistic mass of the electron and v is its velocity. 

21 
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The motions of the photon and electron are illustrated in Figure 3*2. 
The law of conservation of energy gives the relation 

hv = hv + \mv 2 3*1 



while the law of conservation of momentum gives the relations 

hv hv %/ 

• cos a + 


0 


c 

hv 


sin a — mv sin 0 


3-2 

3*3 


The x component of the momentum of the electron is therefore 

Px = “ O' ~ v cos oY 
c 


3*4 


As may be seen from equation 3*1, v is less than v ; that is, the scattered 
light is of longer wavelength than the incident light. For our purposes, 
however, we will obtain a sufficiently accurate value for the momentum 
of the electron if we put v = v in equation 3*4, giving 

h 

Px - r (1 — cos a) 3*5 

A 

If We are to see the light in the microscope it must be scattered by the 
electron into the objective, so that a must lie between the limits 90° — e 
Iwd 90° + e. Since there is no way of telling through which part of the 
objective the light from the electron has passed, we know only that the 
x component of the momentum of the electron lies between the limits 

~ (1 — sin e) < p x < ^ (1 + sin e) 3’6 

A A 


so that there is an uncertainty in the momentum of the electron of an 
amount 


A p x 


Ih . 
~ - sin 

A 


€ 


3-7 
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Owing to the finite resolving power of the microscope, there will be an 
uncertainty in the position of the electron, as mentioned above, of an 
amount 

Ax ~ 3*8 

2 sm € 


The product of these uncertainties is 

A p x Ax ~ h ^ 


3*9 


which is independent of the wavelength of the light used. The attempt 
to gain accuracy in position by using light of short wavelength is there- 
fore defeated by the loss of accuracy in the measurement of momentum. 

This difficulty is quite general. Whatever experiments are devised 
to measure at the same time two conjugate variables, the limit of accu- 
racy always appears to be given by a relation similar to 3*9. This result 
has been assumed by Heisenberg 1 to be a fundamental law of nature, 
and is generally known as the uncertainty principle. 

3b. Wave Mechanics. In order to explain this principle a new 
mechanics is necessary. This mechanics must not, like the old me- 
chanics, assign a definite position and momentum to each particle, but 
must allow an uncertainty in these variables. This is accomplished 
by introducing functions which express, not the fact that a particle is 
at a given point, but the probability that the particle is at that point. 
Such functions are used in the theory of electromagnetic waves. As 
stated in Chapter I, light is corpuscular in nature, at least when it inter- 
acts with matter. The motion of these light corpuscles, or photons, 
is governed by the electromagnetic field, which according to Maxwell's 
equations moves in the form of waves which obey the usual equation of 
wave motion 

d 2 W , d 2 W , d 2 W 1 d 2 W 

+ ~~2 + TT = — ~ 3-10 

dy 


dx* 


dz Z 


d F 


where c is the velocity of light and W is the amplitude of the wave. 
The photon is not definitely located in any part of the wave, but the 
probability of finding a photonat any point is given by the square of the 
amplitude at that point. The energy E of the photon is connected with 
the frequency v of the wave motion by the Einstein relation 

E = hv % 3 11 

and the momentum of the photon is 


p = - 

1 W. Heisenberg, Z. Physik, 48, 172 (1927). 
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A 8 stated in Chapter I, the de Broglie assumption that electrons were 
accompanied by waves controlling their motions in the same manner 
that photons are controlled by electromagnetic waves has been verified 
by the electron diffraction experiments of Davisson and Germer, G. P. 
Thomson, and others. By using crystal lattices as gratings and com- 
paring the wavelength of the electrons as calculated from the diffraction 
pattern with that calculated from the momentum of the electron, the 
validity of equation 3*12 for electron waves has been verified. 

If we assume that the Einstein relation (3-11) is valid for electron 
waves, the velocity of the waves is given by 

, E 

V = \v = — 

V 

The differential equation of the wave motion is then 

d d d 2 V 1 d 2 ^ _ J) 2 d 2 * 

dx 2 + dy 2 + dz 2 ” v 2 dt 2 ~ E 2 dt 2 


313 
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where SF is the amplitude of the wave; the square of this amplitude 
gives the probability of finding the electron at a given point. If we 
wish a solution that will represent standing waves (such as those on a 
string fastened at both ends), we may write ^ in the form 

^ 3*15 


where ^ is a function of x, y, and 2, but not of the time t. For the 
probability of finding the electron at a point we must take the square 
of the absolute value of in order that the probability be real and 
positive. Substituting 3-15 into 3*14 gives 


d 2 \p d 2 \// d 2 \[/ 4tir 2 p 2 

dx 2 + d^ + 'd? = “ "IF"* 


316 


as the differential equation for \p. The kinetic energy of the electron 
is T = E — 7, where V is the potential energy and is connected with 

v 2 

the momentum by the relation T = — . We may therefore write 

2m 


equation 3*16 in the form 


d 2 \f/ d 2 \j/ d 2 \j/ 87 r 2 m 

dx 2 dy 2 dz 2 h 2 


(E — V)$ = 0 


3*17 


This is the first of Schrodinger’s equations, 2 by means of which most of 
the applications of quantum mechanics are made. From the form of 


2 See Chapter I, reference 7. 
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equation 3- 15, it may be noted that this equation may also be written 
as 


d 2 y d 2 -$ d 2 y 

*? + v + 5? + 


87r 2 m / h d_ 
h 2 \27rt dt 


V s ) * = 0 


318 


which is the second of Schrddinger’s equations, Although the greater 
part of this text will be devoted to the solution of equations 3-17 and 
3*18 for particular systems, the above “ derivation ” of these equations 
does not furnish a very satisfactory foundation for an exposition of 
quantum mechanics. We shall therefore proceed to formulate the 
principles of quantum mechanics in more general terms. 

3c, Functions and Operators. Because the mathematics used in 
the general formulation of quantum mechanics is rather unusual, we 
shall first develop the elements of the theory of operators. A function 
is nothing but a rule by which, given any number, we can find another 
number. Thus the function x 2 is merely the rule: take the number 
x and multiply it by itself. Similarly, we define an operator as a rule 
by which, given any function, we can find another function. Thus we 
may define an operator § as follows: multiply the function by the inde- 
pendent variable. This rule is written symbolically 


| fix) = xf(x) 3-19 

Another operator, 8 , is differentiation with respect to the independent 
variable: 

8 /(*) = /'(*) 3-20 


We may develop an algebra of operators, just as we can develop an 
algebra of numbers. The sum of two operators a and p is defined by 
the equation 

(a + p)/(x) = a f(x) + $}(x) 3-21 

The product of two operators is defined by the equation 

ap f(x) = a[p/(:r)] 3-22 

The resemblance of operator algebra to ordinary algebra is only super- 
ficial. Although the operator a + p is the same as the operator p + a 
by definition, the operators ap and pa may be quite different. If 
ap and pa are the same, a and p are said to commute. An example of 
two operators which do not commute is given by the operators | and 
8 defined above, since 

§ 8 /( 3 ) = |[ 8 /(*)] = !/'(*) = xf (x) 

8 | fix) = 8 [| /(*)] = 8 [xf(x)] = xf'(x) +/(*) 


3-23 

3-24 
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Operators are not limited to functions of one variable. We may have 
such operators as S x , defined by the equation 

&*/(*, y ) = z:f( x > y) 3-26 


An important group of operators are the vector operators (Appendix II), 
of which the one most frequently used in quantum mechanics is the 
Laplacian operator, defined by the equation 


v 2 /(w) = y + y + ^ 


3-26 


We define a class of functions as all those functions which obey cer- 
tain specified conditions. Thus there is the class of continuous, single- 
valued functions of one variable x which vanish at x = 1 and x = — 1. 
It may happen that in a given class there are functions which, when 
operated on by an operator a, are merely multiplied by some constant 
a, or in symbols 


a J(x) = af(x) 


3 27 


Those members of the class which obey such a relation are known as 
the characteristic functions or eigenfunctions of the operator a. The 
various possible values of a are called the characteristic values or 
eigenvalues of the operator. For example, if our class is the class of 
functions of x which are finite, continuous, and single-valued in the 

d 

range —tt< x < 7 r , and if our operator is 8 = — , the eigenfunctions 

dx 

are of the form e kx } where k may be real or complex, since 


i ~ 


4~ e kx = ke kx 3-28 

dx 


and every number is an eigenvalue. The class of the functions is very 
important, for, if in this example we had restricted our class by adding 
the condition that the value of the function must be the same at x = x 
and x = —t, the eigenfunctions would have been only those functions 
of the form e tmx f where m is an integer, and the eigenvalues would 
have been the imaginary integers im. 

In quantum mechanics the eigenfunctions which are allowed are 
always chosen from the glass' of functions which are single-valued and 
continuous (except at a finite number of points where the function 
may become infinite) in the complete range of the variables, and which 
give a finite result when the squares of their absolute values are inte- 
grated over the complete range of the variables. If \[/ is such a func- 
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tion and tp* its complex conjugate, the last condition requires that 



dr be finite, where the element of volume dr^dx i dx 2 * * * dx n , the 


x’s being the cartesian coordinates of the particles. By appropriate 
transformations dr is expressible in other coordinate systems. We shall 
refer to this class of functions as the class Q. 

If \p and <p are any two functions belonging to the class Q , and a is an 
operator operating on \p and <p, the operator a is said to be Hermitian if 






3*29 


Hermitian operators have the property that their eigenvalues for func- 
tions in class Q are always real, for, if a is an Hermitian operator and 
^ is an eigenfunction of a from class Q with the eigenvalue a, then 


a\p = a\p 

Taking the complex conjugate of every quantity in this equation, 

aV* = aV 


Then 


and 


J* p/*{(X.\p) dr — a J* \p*\p dr 
J ’<KaV) dr = a* J dr 


3*30 

3-31 

3-32 

3-33 


But if a is Hermitian these quantities must be equal, so that a = a*, 
which is true only if a is real. 

Operators do not generally have the property expressed by the 
equation 

a[cifi(x) + c 2 f 2 (x)] = Cia/i (x) + c 2 af 2 (x) 3*34 

For example, if a is the operator which squares the function, that is, if 

af(x) = [f(x)] 2 

then 

atACr) +/a(*)] = Ifiix)] 2 + 2/j (x)f 2 (x) + [f 2 (x)] 2 3-35 

and 

ofi(x) +a f 2 {x) = [fi(x)] 2 + [f 2 (x)] 2 a[/i (x) + f 2 (x)] 3-36 

If equation 3-34 holds for a given operator, the operator is said to be 
linear. 

3d. The General Formulation of Quantum Mechanics. We are 

now ready to put forward the general principles of quantum mechanics. 
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We shall do this by making certain postulates, which, like the axioms 
of geometry, are not proved. From these postulates the whole theory 
will follow, and finally we shall be led to conclusions which can be 
checked experimentally. The theory will stand or fall on the strength 
of these experimental checks. There is no unique set of postulates in 
quantum mechanics, but the following formulation seems most con- 
venient for our purposes. 

Let us consider a system of particles of / degrees of freedom which 
could be described classically by the values at a given time t of the / 
coordinates q\ * • * q/ and the / conjugate momenta pi * • • p/. We 
then state 

Postulate I. Any state of the system is described as fully as pos- 
sible by a function • • • q h t) of the class Q. This function 
SF(<?i • • • #/, t) is callecTthe state function of the system, and has the 
property that dr is the probability that the variables lie in the 
volume element dr at time t ; that is, q\ has a value between qi and 
qi + dqi, etc. Since each variable must have some value, the total 
probability must be unity, so that 

• -qj, t)V(qi • ■ ■ q/,t) dr = 1 3-37 

where the integral is taken over all possible values of the q’ s. 

Postulate II. To every dynamical variable M there can be 
assigned a linear operator \l. Since we are interested only in observ- 
able quantities, which are real, we may restrict ourselves to the case 
where these linear operators are also Hermitian. The rules for finding 
these operators are the following: 

(a) If M is one of the q 1 s or t , the operator is multiplication by the 
variable itself. 

(b) If M is one of the p’s, the operator is A A, where q . is con _ 

2m dqi 

jugate to 

(<F) If M is any dynamical variable expressible in terms of the q’ s, 
the p's, and t , the operator is found by substituting the operators for 
the g’s, the p's, and t as defined above in the algebraic expression for 
M, and replacing the processes of ordinary algebra by those of operator 
algebra. If there is any ambiguity in the order of the factors, they 
must be arranged so that the resulting operator is Hermitian. 

Postulate III, The state functions ^f(q t t) satisfy the equation 

H (*S5'')* <fc0 "-S5*k , > 


3-38 
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where H(a, u is the Hamiltonian operator for the system. 

\ 27 n dq / 

This is Schrodinger’s equation including the time. 

Postulate IV. If the state function >k( q , t) is an eigenfunction 
of the operator |i corresponding to a dynamical variable Af, that is, if 

t) = m&(q 9 1) 3*39 

then in this state the variable M has the constant value m precisely; 
and conversely. Such a state is known as an eigenstate of M . 

Corollary I. If the state function is an eigenfunction of the 
energy operator, the Hamiltonian operator H, with the eigenvalue E, 
then 'f'iq, t) satisfies the equation 

0 == E*{q, t) 3-40 

If equations 340 and 3*38 are to be consistent, ^{q } t) must be of 
the form 

2*rt „ 

*(?,*) = i(q)e-JT Et 3-41 

where \J/(q) is a solution of the equation 

H (*2 S 3 42 

Equation 3 42 is Schrodinger's equation for a stationary state, that is, 
for a state which is an eigenstate of the energy operator. 

As an example of the manner in which these postulates work let us 
consider the system consisting of a particle of mass m moving in a 
potential field V (x, y , z). The kinetic energy of the particle is 

T & — (yl + p y + p 2 t ) ^ 3*43 

and the classical Hamiltonian function is 

H = ^(pl + pl + rf) + V ( X , y, z) 3-44 

According to the above postulates, the Hamiltonian operator is 

/ h_ d_ f \ = j_ r/_A_ ay /_h_ ay / h_ _ayi 

\ * 2wi dq ) 2m |_\2ri dx) \2?ri dy) \2iri dz) J 

+ V(x, y, z) 



3-45 
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The Schrodinger equation including the time is therefore 

+ 4 + f v 

8 Tr 2 m \dx 2 dy 2 dz 


+ 772 + n ) *( x > y > 2 » 0 + v ( x > y> 2 )*( x > y> 2 > 0 


— h d 

— — ; — 'Wx* ?/, 2, 0 
2W dt K J 

If we look for a solution of this equation in the form 


~2irt 


346 


347 


V(x,y,z,t) = t(x,y,z)e h 
wc obtain the equation 

+ h‘ + i?) + |,(I ' 2) ] y ' ‘ w<1> *■ 348 


This is identical with equation 3*42, so that, if ^(x, y ) z) is a solution 
of equation 3-48 and is a function of class Q, it is an eigenfunction of 
the energy operator and describes a state with the precise value E for 
the energy. Equation 3*48 may also be written as 


dV d 2 \p aV 8 t r 2 m 

dx 2 dy 2 dz 2 K 2 


(E -7)^ = 0 


349 


so that in this special case the general theory gives the same result as 
the theory of section 3b. 

In most of our discussions wc shall be dealing with state functions 
which are eigenfunctions of the energy operator for a given system. 
For such state functions equations 3*37 and 3*39 are equally valid if 
^iq, t) is replaced by yp{q), as may be seen from the relation 3*41 between 
these functions. 

From t{ie postulated form of the operators for position and momen- 
tum we can obtain an important rule for their product. Since 


and 


pq<£ = 


hd h h d\p 

(qnf,) = $ + q — - 

2 tri dq 2 rri 2r%.dq 


q# 



h d\j/ 
2i ri ^ dq 


we have the result that 


4 


(pq - qp)£ = — * 


3-50 

3-51 


3*52 


a rule first discovered by Heisenberg. 1 
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3e. Expans ion Theorems. If two functions <pi{x) and ^(x) have 
the property that 


X 


^ 2 (^)v?i(^) dx = 0 


3*53 


for a certain interval (a, b), the functions are said to be orthogonal in 
this interval, A set of functions 


* * * 


such that any two functions in the set are orthogonal in the interval 
(a, b) is called an orthogonal set for the interval (a, b). If in addition 



3-54 


for all values of i, the set is said to be normalized. 

The great importance of orthogonal functions lies in the possibility 
of expanding arbitrary functions in a series of those orthogonal func- 
tions. Suppose that/(x) is any function and that it is possible to expand 
f(x) in the interval (a, 6) in a series 


f(x) = civ?i(t) + c 2 <p 2 (x) H + Ctfiix) + * • * 3*55 

where the c's are constants. If wc now multiply both sides of equa- 
tion 3-55 by (p* (x), where p n (x) is a member of the set of normalized 
and orthogonal functions given above, we have 

f <P*(x)f(x) dx = 

J a 

f* f) 

Cl J <Pn (x )<fii (*) d£ -f • • • + C„ J <pt(x)<p n (x) dx + • • • 3-56 


All the integrals on the right side of 3-50 vanish because of the orthog- 
onality of the functions except 


hence 



357 


so that the coefficients are easily found if the expansion is valid. The 
question as to when such an expansion is possible is beyond the scope 
of this text, but it is possible for all functions of the class Q. 

We shall now show that the eigenfunctions of any Hermitian operator 
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are orthogonal functions in the interval corresponding to the complete 
range of the variables. Let the operator be a, the eigenfunctions fa 
and ^2# with the eigenvalues a x and a 2 , that is 

Hi = ai^i a ^2 = 82^2 3*58 


Now consider the integral 



Since a is Hermitian, we have 





dr = 0,2 



dr 


3*60 


the last equality arising from the fact that a 2 , being an eigenvalue of 
an Hermitian operator , is real. Hence 


a\ J 'I'tfii dr = 02 yV dr 
(ai — a 2 ) J dr ~ 0 


3*61 


If a x 7* <t2, this requires that 

j* tHl dr = 0 


3*62 


so that eigenfunctions corresponding to different eigenvalues are orthog- 
onal. When two or more eigenfunctions have the same eigenvalues 
this argument breaks down. In this case, however, a set of orthog- 
onal eigenfunctions can always be found. Suppose that 


Hi = aty x \ 


H2 = H2) 



If we replace ^ 2 by ^ 2 = ^ 2 — Hu then 


3*63 



and 


H2 = a (^ 2 ~~ Hi) “ H2 bHi 
= a *- b\j/i) = a^2 


3-65 


so that is also an eigenfunction of a and is orthogonal to fu and if 
we use #8 instead of our eigenfunctions are all orthogonal* The 
process by which $2 was found is known as orthogonalization. 



EXPANSION THEOREMS 


33 


If and ^2 are two eigenfunctions of the operator \l corresponding 
to the dynamical variable M , with eigenvalues mi and m2 (mi ma), 
the state represented by ^ + c 2 ^2 is not an eigenstate of |t, since 

|t^ = }t(ci^i + c 2 i£ 2 ) = Cim^i + c 2 m 2 ^ 2 ^ w(ci^i + c 2 ^ 2 ) 3*66 


In order that 


y V V dr 


1 we must have the relation 


/< 


(W + c*tA*) (ci^i + 02^2) dr = c*ci + c*c 2 = 1 


3-67 


between the coefficients. We may give the following interpretation 
to the state ^ by a postulate, known as the principle of superposition. 

Postulate V. If \pi and \p 2 are eigenfunctions of the operator \l 
corresponding to the variable M with the eigenvalues mi and ?n 2t then 
the state represented by = crfi + c 2 ^ 2 is that state in which the 
probability of observing the value of M to be mi is cfci and the proba- 
bility of observing the value m 2 is c 2 c 2 . Since the set of eigenfunctions 
\f/i, ^ 2 , • • * yj/i, • • • of an operator \l is a normalized, orthogonal set, we 
may expand any state function <p in terms of these \pi s: 


where 


(p = HCtti 

i 


c % = J tfiV dr 

Consider now the integral J* <p*\up dr . Expanding <p in terms of the 
eigenfunctions of \l we have 

f <P*\L*pdr = J 


/ 


= / dT = lifflicjc. 


3-68 


But 23 m * c < c » is merely the average value of Af in the state so that 

< 

we have the important theorem : 

Theorem I. The average value of any dynamical variable M in 
any state <p is given by J <p* |i$> dr, where is the operator corresponding 
to M. 

An important example of an expansion of this type occurs when <p is 
formed by operating on one of the eigenfunctions of an operator |t 
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by the operator a of some other variable, so that = a V7. In this 
special case we have 

<Pj = a \f/j = 3-69 

i 

where 
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The set of quantities a t; found by expanding all the functions a 1/7 is 
called the matrix of a. The quantities are usually written in the 
form of a table: 


«11 

a X2 

* 

. A 

«21 

(l<22 

a 2Z * 


«31 

&32 

<*33 * 



• 

* 

• •/ 


If a is Hermit ian 


a ij 
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Suppose that we also have an operator (3 such that 

= Z&.jtf'.-; bn = j' dr 3-73 

and wc wish to find the matrix r tJ of the operator y = ap. We have 

ap ij = a.Y.bkj'Pk = ZhMk 

k k 

- 'L’Lhjdtk'l'i = T,(T,(hkbkj)ii = LoA 3-74 

k x i k % 


so that 


C%j — llL,Q'ikf ) kj 

k 


3-75 


3f. Eigenfunctions of Commuting Operators. In most of the 
problems in atomic and molecular structure with which we shall be 
concerned we will be interested in several operators at the same time. 
The eigenfunctions of one operator are usually different from those of 
another operator, but there is a very important exception in which one 
set of functions is a set- of eigenfunctions of two operators at the same 
time, namely, when the operators commute. In this case we have the 
theorem : 
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Theorem II. If two operators a and p commute, there exists a set 
of functions which are simultaneously eigenfunctions of both operators. 

Let a and p be two commuting operators and let and be the 
eigenfunctions of these operators, so that 

o-'Pi = a,-*?; Hi = W* 376 

If wo expand tpi in terms of the = Ecjt'Pj wo have 

3 

a \j/i — — aiUcji'l'j 

3 3 

or a - a f )$ = 0 3-77 

i 

The function (a — a,)^ is an eigenfunction of p, or is identically zero, 
far, since a and p commute 

P{ (a - o,)^} = (a - ai)Hi 

= (a - ai)b,tj = hj{ (a - 3-78 

and the eigenvalue of this function is the same as that of ipj itself. Let 
us suppose that there are no two of the 6/ s which are equal, that is, 
the form a set of non-degenerate eigenfunctions. Then it follows 
that \f/j and its multiples are the only functions which satisfy the equation 

/ty = 3-79 

The function (a — must therefore be some multiple of \pj ; that is 

(a - Oi)$ = Qiji'j 3-80 


where gij is some numerical constant. 'Substituting this relation in 
3-77, we have — 0. If we now multiply by \f/l* and integrate 

j 

over the variables we have 


iQij y 


4'k*4'j dr = Ckigik = 0 


3*81 


so that either Cki or g ^ is zero. If g is zero, then 

(a - ai)$ = 0; atf = 3-82 

and \f/j is an eigenfunction of a with the eigenvalue a,*, as well as an 
eigenfunction of p. If Cki = 0, then 

ftiW dr = Ecu flPtf dr = c w = 0 3-83 

and we see that 4\ is orthogonal to Since the set of functions 
form a complete set, no function can be orthogonal to all of them; 
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* 

there must be at least one value of k for which gu = 6. We thus see 
that when there is non-degeneracy every eigenfunction of p is also an 
eigenfunction of a. 

If, as usually happens, there are several v^-’s with the same eigen- 
value, the above argument does not hold, Suppose, for example, that 
and bave the same eigenvalue. Let us then make the following 
change in the ^y’s. We write 

ii = evft + cA 

and construct another function 

^2 = + <^ 2^2 


with the constants d x and d 2 chosen so that $ is orthogonal to \p 2 . 
If there are other degeneracies we adopt an analogous scheme. We now 
carry out the analysis using the ; s. Owing to the degeneracy of 
ypi and \j/ I , we have in place of 3-80 the two equations 

(a — = gn4\ + Qi 2 'P\ 

w f „ . 3*80a 

(a - a<)ti = haft + K&& 


where gn = -- dr 9 etc. It is now possible to form 

linear combinations of \f% and 4$ which are eigenfunctions of a, that 
is, we can find constants kn and /c, 2 such that 


(a - ai){kn^ + k i2 $) = Ai{kn4i + k&l%) 3*84 


By multiplying the first of equations 3* 80a by kn and the second by 
k{ 2 and adding, we find that kn and k& must satisfy the relations 


AJcn — kngn -f- ki 2 hn 

AJcfe ==: k ngi2 * 4 ” ki 2 hi 2 


3*85 


When these relations are satisfied, the functions (ka4i + ki 2 4 2 ) are 
eigenfunctions of a with the eigenvalues A t - + a*. They are of course 
eigenfunctions of p with the eigenvalue b x . 

Theorem II has thus been proved. The analogous theorem holds 
for the case of more than two operators; we shall use this theorem 
without proof. 

We may also prove the converse of this theorem: 

Theorem III. If there exists a complete set of orthogonal functions 

which are eigenfunctions of two operators a and p, then a and p 
commute. 

Let us expand any function <p in terms of the ^’s, and then operate 
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on <p with (ap — pa). We have 

(ap — paV = (ap — pa )!Lc$i = = 0 3*86 


Since <p is an arbitrary function we see that a and p commute. 

The physical interpretation of these theorems is that, if two physical 
quantities have operators which commute (as do the operators for the 
energy and the total angular momentum of a system), then it is pos- 
sible to have states of the system in which both variables have definite 
values. Conversely, if it is possible for two variables (physical quan- 
tities) to have definite values for a complete set of states at the same 
time, then the corresponding operators commute. 

Another theorem which we shall find useful is: 

Theorem IV. If p is an operator which commutes with an operator 
a (where both are Hermitian), and and ^ 2 are eigenfunctions of a, 


then the matrix element 



dr vanishes unless a\ = a 2 , where 


a\ and a 2 are the eigenvalues of \j/\ and 1^2* 

To prove this theorem consider the integral 




dr 


3*87 


Using the fact that a is Hermitian, 

J ^*pa ^2 dr = J $fa(p^ 2 ) dr = J (p^2)(aVi) dr 

= a \J P^2 dr 


3*88 


Therefore 


(ai - a 2 ) 



dr = 0 


3*89 


so t iat 


, dr = 0 if ai 5^ a 2 . 


3g. The Hamiltonian Operator. We have seen that the Hamil 
tonian operator for a single particle is, in rectangular coordinates, 

B -i£(£ + $ + £) +r(x ’> / ’‘ > 


3-90 


but we have not as yet proved that this operator is actually Hermitian. 
Such a proof is necessary, for if we had written the classical Hamil- 
tonian in the form 


H = i (i 1 * + ; * + 7*) + F(x ' »■ *> 


3-91 
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which is algebraically equivalent to equation 344, and- then substituted 
the operators for the p’s, we would have the quite different result 


-h 2 / d 2 Id d 2 Id d 2 1 a\ 

H = ~o — ( ~r ~2 "I — ^ — b t - 5 "I — "T — b ~z ~2 H — ~ ) + V (x, y, z) 3 - 92 
8 ir 2 m \dx 2 x dx dy 2 y dy dz 2 z dz/ 

Since H = H*, the condition that H be Hermitian is 

.///: <p*H\p dx dy dz = SSL dx dy dz 

provided that <p and ^ belong to class Q. This condition requires that 

d 2 

<p and ^ vanish at infinity. Consider the operator — . We have 

ox 2 


If J ’ . ^ S * * * - //_„ <’* to * *1 


SSL 


d<p * '* 

dx dx 


dx dy dz 


Since ip* vanishes at infinity, and — is finite or zero, the first integral on 

dx 

the right is zero, hence 

SSL «■* % * < * * - -///_. £ 2 * * * 

In the same way, we find 

SSL*&**—SSL.ZZ*** 

d 2 d 2 d 2 

so that the operator and analogously the operators — 2 and ^5, are 

Hermitian. The Hamiltonian operator 3 90 , being the sum of Her- 
mitian operators multiplied by real constants, is therefore Hermitian. 
d 

For the operator — , we find 
dx 

U t/ to- 00 dx %) «/to_00 OQ 

-///_.* to'*** 

■ "///_ * to 1 * ' * * " I IS. * £ * * * 
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so that the operator — is not Hermitian. (From the above equations, 

ox 


however, it is apparent that the operator i — is Hermitian.) The 

ox 

1 d 

operator will likewise be non-Hermitian, so that the Hamiltonian 

X 02 / 

written as 3*92 is not valid from the quantum-mechanical viewpoint. 
Written in vector notation, the Hamiltonian operator 3-90 is 


H = V 2 (z, y, Z ) + V(x, y, z) 

07 T m 


3-94 


Rectangular coordinates are not always the most convenient coordi- 
nates to use. Since the Laplacian operator is invariant under a trans- 
formation of coordinates (Appendix III), the Hamiltonian operator 
in an arbitrary coordinate system will be 

H = V 2 (£, ,, f) + Fft, V, r) 3-95 

07 T m 


where £, rj t f are the new coordinates. The transformations from rec- 
tangular to various other important coordinate systems are given in 
Appendix III, as are the expressions for the Laplacian operator V 2 and 
the volume element dr in these coordinate systems. 

Angular Momenta. Of almost as great importance as the 
Hamiltonian operator are the operators connected with angular momenta. 
For a single particle, the angular momentum about the origin is 
M = r X p, where r is the distance from the origin and p is the linear 
momentum, or, in terms of its components M x , M v , and M g in rectan- 
gular coordinates, 

M x = yp z - zp y 

M y = zp x — zp g 3*96 

M z = xp y - yp x 

Replacing the p’s by the corresponding quantum-mechanical operators, 
we obtain the operators for the components of angular momenta: 



3-97 
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The total angular momentum is, of course, 

M = i M x + j M y + k M t 

We shall never have occasion to use M itself, but only its scalar product 
with some other vector, or its square: 

M 2 = M| + M 2 + 3-98 

The angular momentum operators are usually expressed in spherical 
coordinates. By means of the transformations given in Appendix III 
the operators in spherical coordinates are readily found to be: 8 

8 We may illustrate this calculation for M x . From the transformations 
x — r sin 0 cos <p 
y * r sin 0 sin <p 
z — r cos 0 


we obtain the reverse transformations 


r* 
cos 0 


x 2 + y 2 + z 2 

f 

V x 2 + y 2 + z 1 


. y 

tan ip — - 
x 

from which we calculate the partial derivatives: 
dr 

— = cos 0 
dz 


Therefore 


d0 

dz 

djp 

dz 

M* ~ 


sin 0 
r 


dr 

dy 
de 
dy 

dp _ 

dy r sin 0 


sin 0 sin <p 

cos 0 sin ip 
r 

COS ip 


[' 5 — £] 

r . . /dr d , de d . dr> d\ 

Jdr S . »1 ,M\1 

h r . . a 

— . (r sm 0 sm <p cos 0 — r cos 0 sin 0 sin <p) — 

2 in L dr 

d d *1 

+ ( — sin 2 0 sin <p — cos 2 0 sin #>) ~ + ( - cot 6 cos <p) ~~ 

d0 dipj 

h f . 6 dl 

— . I — sm — — cot 0 cos ip r- I 

!irt L 00 a^J 


A 

27rt 
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M, = 
M„ = 
M, = 
M 2 = 


A ( d d\ 

2m \ dd dip) 

h ( d . . d\ 

sr'i-*'-';;) 

h a 

2m d<p 

-A 2 r 1 d/ d\ 1 0 2 1 

4tt 2 Lsin 6 dd V Sm <W/ + sin 2 6 d<p 2 J 


3-99 


The commutation rules for M„ M„, and M* are most readily found 
from the expressions in rectangular coordinates 


M,M„ = 


MyM* = 


47 T 2 

— A 2 

4ir 2 


a a 2 a 2 a 2 a 2 

V b 2/z xy — 5 — z 2 b zz 

y dx y dxdz y dz 2 dx dy 


yZ z‘ 

y dxdz 


dx dy 


_ X2/ _ +a; _ +xz 


dy 


dy dz. 

J! 

dy dz 


3-100 


3101 


so that 




1 

d 

d\ 



dx 

~ x dy) 


ih h 

(x 


3 

II 

2ir 2m 

V 

dy V dx) 


Similarly, we find 



M,M* - M*M, = — M v 

2tt 


3-102 


3-103 


If we now consider the commutation of M 2 and M„ we see immediately 
from the expressions for the operators in spherical coordinates that 

M 2 M, - M.M 2 = 0 3-104 

bo that M 2 commutes with M f . Because of the equivalence of M s , 
M y , and M„ M 2 will also commute with M* and My, as may be proved 
explicitly from the expressions for the operators. These commutation 
rules will be shown later to hold for systems of particles as well as for a 
single particle. - 

From the commutation rules derived above, it is possible to deduce 
the most important properties of angular momenta. Since M 2 and 
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commute, it is possible to find a set of functions which are eigen- 
functions of both operators simultaneously. We shall denote these 
eigenfunctions by Yi , m , where Yi t m satisfies the equations 

m 2 r*, m = fc z r,, m 3*105 

M ,Y lt m = k m Yi , m 3*106 

where the subscripts l and m identify the eigenvalues ki and k m asso- 
ciated with Yi, m . Writing M 2 in terms of its components, 3*105 be- 
comes * 

(M l + M 2 U + M l)Y u M = k{Y u m 3-107 

and applying M* to equation 3-106 yields 

M *Y l , m = klY l , m 3-108 

Subtracting 3*108 from 3*107 gives the result 

(M 2 + M 2 )^, m = (ki - k 2 JY l% m 3*109 

so that Yi t m is an eigenfunction of (M 2 + M 2 ) as well as of M 2 and M f . 

Let us now write Yi ffn = £ crth where the fa ’ s are eigenfunctions of 

* / 

M x with eigenvalues m X i. Then 

M 2 x Y lt m = 'LcimlfPi = EE «»/.,. . 3-110 

i n v 

where 

f Yt m M 2 x Y l , m dr= f (E4^)(Ec,-m^,)dr = EM 2 tf& 3-111 

Jit i 

Similarly, we can find for MjjFj, m the expression 

M 2 v Yi, m = EE&», P Y n , p ; where b„ , p = E 3-112 

n p i 

Comparing 3*109 with the sum of 3*110 and 3*112, we see that 

ki - k 2 m = E(M 2 w&- + \di\ 2 m 2 yi ) 3-113 

i 

Since M x and M y are Hermitian, m 2 xi and m 2 yi are positive; the right 
side of 3*113 is therefore positive, so that 

ki > k 2 m 3*114 

We now make use of the commutation rules for M x , M v , and M f . It 
is easily verified that 

M,(M* + iM„) = (M z + zM^M, + 3.115 

M*(M Z - zM„) = (M x - zM„)^M, - 3.116 
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Operating on Fj, m with both sides of 3115 gives 


M.{ (M, + iM„) Fj, m } = (M, + iM^M, + ~^Y,, „ 

= (k m + £){ (M, + iM„) F,. m } 3-117 


so that (M* + iM„)F/, m is seen to be an eigenfunction of M, with the 
Jl 

eigenvalue k m + — or zero. Since M 2 commutes with all the operators 
2tt 

in 3-115, (Ms + iMy) Yi t m is still an eigenfunction of M 2 with the eigen- 
value hi. In an analogous manner we see that (M* — iM y )F| |W is 

an eigenfunction of M* with the eigenvalue k m — or zero, and of M 2 

2w 

with the eigenvalue fc/. We may therefore obtain a whole series of 
eigenfunctions of M 2 and M*, all having the eigenvalue ki for M 3 but 
having the eigenvalues 


2 h h h 2h 

*} k m ~ , k m — , k my k m -f* t i k m -f* ~~ , 
2t 2tt 2tt 2ir 


for M*. This series must terminate in both directions, since 3*114 
states that the square of the eigenvalue for M* must be less than 
Let us denote by k m t the lowest and by k m n the highest allowed eigen- 
value of M*, and the corresponding eigenfunctions by Yi, m t and Y\ 9 m n. 
(M x + fM y )F*, m n should be an eigenfunction of M* with the eigen- 
h 

value k m tt + — or zero, and since by hypothesis k m n is the highest 
2tt 

eigenvalue we must conclude that 

(M, + tM,)F|, m » = 0 3-119 

For similar reasons, we must have 

(M, - tM,)F|. m / = 0 3-120 

Operating on 3-119 with the operator (M x — iM„) gives 
(M, - tM v ) (M* + iM„) F j, m „ 

= {M^ + M 2 + i(NL x M y - M„M X )} F,. 

= (m2 + M2 - A Y lt = (m 2 - M 2 , - A Y,, m „ 



ki — k 


2 

m" 


h_ 

2v 



Y U 


m" 


= 0 


3-121 
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so that 


kl = fcm" + ^ &m" 


3122 


By operating on 3-120 with (M, -f- iM„), we find 


ki = h 2 m t — — 


3-123 


In order that equations 3-122 and 3-123 will be consistent, with k m " 
> k m >, we must have k m n = —k m r. According to 3-118, k m n must be 

greater than k m > by an integral multiple of ~ so that k m n must be of the 


2x 


nh 


form — , where n is a number in one of the series 0, 1, 2, 3, or 2> f » ' • '• 
Ztt 

Since we have so far put no particular significance on Z, and since k m n 

jh 

depends on l only, we may put l = n, so that k m n = — . Then 

2tt 




by 3*122, or 


ki = ni+i )£- 2 


The possible values of k m are then 


* n - d— 

2w’ 1} 2x’ 


Ih 

2t 


and we may specify m by 


7 mh 7 -v 7 

, l > m > —l 


3*124 


3*125 


3-126 


If l is an integer, m is also an integer; if l has half-integral values, m 
likewise has half-integral values. 

We may now write equations 3T05 and 3T06 as 

M 2 Fj, m = 1(1 + 1) Yl, m 
M t Yi im = m—Yi'm 3-127 

2ir 

As these results were derived entirely from the commutation rules, they 
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will be valid for any operators whose commutation rules are similar to 
those for angular momentum. 

With the aid of the operators (M x ± iM v ) we may obtain the form 
of Yi, m for the case of a single particle where the explicit form of the 
operators is known. We have seen that (M* — iM v )Yi t m is an eigen- 

h? 

function of M 2 with the eigenvalue l (l + 1)— 9 and of M* with the eigen- 

47T 

value (w — 1) . The eigenfunction (M* — iNL v )Yi tm 

Z'K 27 r 27r 

is therefore related to the eigenfunction Fj, m _i by the relation 

(M, - iM u ) Y t , m = NYt, 3-128 

where N is a constant numerical factor. In order to determine N we 
use the requirement that Yt, m be normalized. We therefore multiply 
3-128 by its conjugate and integrate over the coordinates. This gives 

/{( M, - iM v )Fj, m }*(M x - iMy)Fj, m dr 

- NN*f Yt ry -1 Yt, dr = N 2 3-129 

since we may choose our normalizing factor to be real. M* and M y are 
Hermitian, so that the first integral may be written as 

N 2 = f{(M x - iM v )Yi. n }*M x Yi, m dr 

~if{( M,- t'M I/ )F !i m }*MyFj, m dr 


= f Yt, m M* x { ( M x - iMy) Fj, m }* dr 

- if Yi. m My{ (M, - iMy)F,. m }* dr 

= J Yt, 4m: 2 + My* 2 + i(M:My* - My*M:)} Yt m dr 3-130 

Taking the complex conjugate of this equation, and making the obvious 
substitutions, we have 

N 2 = f Yt m (m 2 - M 2 +^M,j Yt, m dr 
= {*(* + 1 ) - rn(m - 1 )} 


3-131 
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(M, - ^ V/(Z + 1) - m(m - 1) F I( m _ 1 

= A V(Z + m )(i - m + 1) Fi. 3-132 

Z7T 

In a similar way, we find 

(M* + iM y )Y t , m = ~ Vz(/ + i) - m ( TO + l) F,. m+1 


- £ V(Z + w + l)(Z-m) Fi, „+! 3-133 


Let us now assume that Fj |W ( 0, <p) may be written as the product 
9j, Using the operator for M* in spherical coordinates, 

we then have 


~ £ {01. m(0)<U*>)} = ^ {©,. .(«)*W(*)} 


or 






The solution of this equation is 

*.(#>) = Ne im ' p 


3-134 


3-135 


3-136 


In order that be a single-valued function of position, we must 

have 


e imV _ 3.137 


= $m(<P + 2ir) or 

This requires e 2 ”"* to be unity, which is true only if m is an integer. 
The normalization condition 

f %*(»>)$»(#>) d<p — N 2 f dip = 1 3-138 

J 0 V 0 


gives the value —7= for N. 

v2ir 

In order to calculate 9j, m (0) we apply 3-119 in the form 
(M, + iM„)F/, , = 0 

'In spherical coordinates the operator (M* + iM„) is 


3-139 


h ( Sa d 


d\ 
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Equation 3*139 is therefore 

A * 

2*- V2ir l\o 


8 <n 

S + 


which reduces to 


-©!,,(«) - loot® Of, |(«) = 0 


d 0/ ile) cos 0 

\ I TT = l—d9 3-143 

0 it i($) sin 0 

Integrating both sides of this equation with respect to 0, we readily find 
the solution 

0/ f i(6) = Nu sin 1 0 3444 

The normalization factor iVn is given by 




sin 21 9 sin 9 do = — sin 2i 0 cos 


® fl]o + 21 f 
J 0 


cos 2 9 sin 21 1 9 do 


— f sin 21 1 

1 »/o 


If the integration is repeated l times, we obtain 

1 21(21 - 2) • • • 2 r* . 

JV2 “ (21 + 1) (21 — 1) • • • 3 Jo 81 
2(2 l (!) a 
~ (21 + 1)! 

The normalized solution of 3-142 is therefore 


sin 6 dB 


©i. iW = (— 1) 


, / (21 + 1)! 1 . , 

* am* i 


where the factor (— 1) J has been introduced for our convenience later. 
From the expression thus obtained for Fj, j, F|, m may be found by 
repeated application of 3-132. The result obtained in this way is 
equivalent to the following compact expression, as will appear in the 
next chapter (section 4e). 

Yi.miO,?) = 

(-Jiy f(» + l) q - |w|)i „ d t+M sin at g ^ 


- sin 1 *" 1 6 73 3-148 

! (dcose) ,+|m| 


where \m\ is the absolute value of m. 



CHAPTER IV 


THE DIFFERENTIAL EQUATIONS OF QUANTUM MECHANICS 1 

4a. The Linear Differential Equation of the Second Order. We 

have seen that the Hamiltonian operator is a second-order differential 
operator, so that the equation H \j/ = E\// is a second-order differential 
equation. In order to find the eigenfunctions of H we must therefore 
develop the technique of solving differential equations of this type. 
For the present we shall confine ourselves to the case of one inde- 
pendent variable. The general equation to be solved is then of the form 

+ Pte) + q(x)y = 0 4-1 


where p(x) and q(x) are given functions of x. 

Suppose that, in the neighborhood of the point x = x 0) y can be 
expanded in the form of a power series 

y = a 0 + ai(x — x 0 ) + a 2 (x - x 0 ) 2 + • • • 4*2 

Taylor's theorem then states that 

_ MA __L/ d V\ a o 

oo-y^-x,), °i a 2 2!wL; 43 

If we solve 4-1 for r-X , we find an expression for in terms of ~~ 


and y . If we differentiate 4-1, we then find an expression for in 
d 2 y dy 

terms of , -j- , and y. By repeated differentiations we may therefore 
dx ax 

find any derivative of y in terms of the lower derivatives. Now suppose 

dy 

that at the point x = x 0 we are given the values of y and — . The 

dx 

differential equation then gives all the higher derivatives, and from 
these the constants oq, ai, a 2 • • * in the series 4*2 are determined by 

dv 

4-3. Thus from the values of y and -j- at one point we may determine 

dx 

1 Some readers may prefer to delay the study of this chapter until they require 
the specific results for the solution of the equations in later chapters of the text. 

48 
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the whole function throughout the interval in which the series 
converges. 

In practice this method usually turns out to be rather clumsy. If, 
however, the differential equation can be put in the form 

dv 

P (x) + Q(x) ^ + R(x)y = 0 44 

where P(x), Q(x), and R{x) are polynomials in x, the solution is much 
simpler. We expand y as a power series in x 

y = a 0 + d\x -[”••• -f- a v x v -f- * * * 4*5 

and differentiate the series term by term, obtaining 

dy , 

— = a\ -f- 2a 2 x 4 + (y ~f~ tyd v +\x v 4~ * * * 4*6* 

ax 

P = 2«2 + 3-203* + • • - + (y + 2)(v + 1) a^x v + • • • 4-7 

ax 


If we substitute these series in the differential equation, the coefficient 
of every power of x must vanish. Putting the coefficient of x v equal 
to zero then gives a relation of the form 


a v = Cia,_i 4~ C2dp ~. 2 4 — * + Ckdp—k 4*8 


where the c’s are constants. The number of c’s occurring in this 
expression will depend on the form of the differential equation. Such 
a relation is known as a recursion formula; by means of it the whole 
series can be found when the first few terms are known. 

As a simple example of this process consider the equation 


A 

dx 2 


y = 0 


4*9 


where P(x) = 1, Q(x) = 0, R(x) = — 1. If we put 

y = a 0 + a ix + d 2 x 2 4 * 4*10 

and substitute in the equation, we find 

0 = (2a 2 — do) 4 - (3*2a 3 — di)x + **• 

+ [(v + 2)(v + l)a v+2 - a v ]x v 4 - • * * 4*11 

Equating the coefficient of x v to zero gives 

. gc dp — 2 

0>+2 + 2) {y + 1) ’ F °” v(v — 1) 


412 
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If we take Oo = Oi = 1, we obtain the particular solution 

a»3 

y = i+* + — + — -)-•••+ — + -** = e* 4-13 

whioh is seen to satisfy the differential equation 4-9. 

If we attempt to use this method on the equation 

x 2 ^ + x^+(x 2 ~i)y = 0 4-14 


we find 

0 = — ^0 + + «o )^ 2 + • • * 

+ [O' 2 i)uv + cl v — 2]%* + • • • 4-15 

so that the recursion formula is 


a v - 2 



4*10 


In order that the constant term and the coefficient of x vanish, we must 
have a© = a% = 0. The recursion formula then requires that all the 
remaining a’s vanish, so that we get no solution at all. A little con- 
sideration shows why this has happened. If we write equation 4*14 
in the form of 4*1, we have 


fy 1 1 dy 
dx 2 x dx 



= 0 


417 


When x = 0, p(x) and q(x) become infinite, so that t- 4 cannot be 

dx 


found from this equation. The point x = 0 is called a singular point 
of the equation. If it is possible to write a differential equation in 
the form 


X 2 xl + xp'(x) — + q'(x)y = 0 


dx 2 


418 


where p'{x) and q' (x) are finite at x — 0, the point x — 0 is called 
a regular point of the equation. A singular point which is not regular 
is called an essential singularity. 

In the neighborhood of a regular point a differential equation can 
usually be solved by the following method. Instead of starting the 
power series with a constant term, we use a series of the form 

y = aoX L + aix L+l -f- a 2 x L+2 + • • • 


419 
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where a 0 0 and L may have any value. Then 

~ = TLclqx^ * “f* (Z/ "h l)aix^ -f* * * * 4*20 

ax 

= L(L — tyaox 1 ^ 2 -f* (Z/ H f* l)Laix L 1 -f- * * * 4*21 

ax 

If we substitute these series in equation 4*14, we find 

0 = (L 2 - l)a 0 x L + { (L + l) 2 - ijaix 1 ^ 1 

+ {[(L + 2) 2 — J]a 2 + ao}x L+2 + * * * 

+ {[(£ + v) 2 3-]^ + a y _2}x L+,; + * • * 4*22 

which leads to the series of equations 

(L 2 - fa = 0 

[(L + l) 2 - J] fll = 0 

{[(Z> + v) 2 — J]a„ + a„_ 2 } = 0 4*23 

The first of these is called the indicial equation. Since a 0 ^ 0 we 
must have L 2 = J, or L = The other equations then determine 

the other a’s in terms of ao. There are thus two solutions: 

K- V»‘{ 1 -|+i 5 + "] 4-34 

!/-<vr , ‘{l-!+^ + -.-] 4-25 

If we are interested only in the solutions of a differential equation 
which belong to the class Q, it is essential to investigate the behavior 
of the solutions at all the singular points of the equation, for if the 
exponent L is negative or fractional the function y is either infinite or 
multiple-valued and therefore cannot be of class Q. 

Infinite series are at best awkward things with which to work. For 
example, it is very difficult to derive the properties of the function sin x 
from the series 

sinx = * — - + - — - + 4-26 

whereas the properties are easily derived from the ordinary trigono- 
metric definition. For this reason the various functions which are met 
in quantum mechanics are defined by some other definition than the 
power series derived from the differential equation, and are then iden- 
tified with this series. 
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4b. The Legendre Polynomials. Consider the equation 
(1 — x 2 ) ~ + 2 nxy = 0 


4-27 


If we write this in the form 


dy__ 2ntdx 4 . 28 

y (l - x 2 ) 

it may immediately be integrated to give 

y = c(l - x 2 ) n 4*29 

where c is any constant. If we differentiate equation 4*27 (n + 1) 
times, the result is 


/, 2 \ dn+2 y „ dn+1 y . , , u dn y n 


4-30 


which may be written as 

(1 — x 2 ) ^4 — 2x + n{n + l)z = 0 
ax ax 

where 


z 


d^y 

dx n 




4-31 

4-32 


Equation 4-31 is known as Legendre’s equation. The particular 
solution 


* ” K(x) ‘ 2^! S ^ “ 1)m 


4-33 


is called the Legendre polynomial of degree n. [Po(x) is defined to 
be unity.] It is possible to show that these functions form a system 
of orthogonal functions in the interval — 1 < x < 1, and also that these 
polynomials are the only functions of class Q which satisfy equation 
4-31 in this interval. As these functions are special cases of a more 
general set of functions which we shall now discuss, we shall prove 
these statements only for the general case. 

4c. The Associated Legendre Polynomials. If equation 4-27 is 
differentiated (m + n + 1) times, we obtain the equation 


(1 - x 2 ) 


<T +n+ 2 y 

dx m+n+2 


— 2 (to + l)x 


<T +n+1 y 

dj.m+n+1 


(m + n + l)(n — m) 


dr +n y 

dx m+B 


0 4-34 
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which may also be written as 

d^z dz 

(1 — x 2 ) — a — 2 (m + l)x rr + (m + n + l)(n — m)z = 0 4*35 

dx z dx 


where 


d m+n ‘ 


z = 


y 


dx m+n dx m 


d m d m+n 

— Pn(x)=c—r-<l-x>r 


dx m+n 


Let us now put 


Then 

dz 

dx 

d 2 z 
dx 2 


du mxu 
dx 1 — x 2 


z = u( 1 — x 2 ) 2 

__ m 

(1 - X 2 ) 2 


4*36 


4*37 


d 2 u 2 mx du 

2 + ; 9 ~ h 


[dx 2 ' 1 — x 2 dx 
so that the differential equation for u is 
d 2 u 
dx‘ 


( m m(m + 2)x 2 \ | „ 2x - 

(j^? + (1-.V j“) (1 -^> 


(i - *’) 




4*38 


This equation is known as the associated Legendre equation, and the 
function u, denoted by u = P% (x), is called the associated Legendre 
polynomial of degree n and order m . From equations 4*36 and 4*37, 
we see that 


m m j m 

K(X) = (1 - X 2 ) 2 2 = (1 - Z?) 2 ^Pn{x) 


or, using 4*33, 


P7(x) = 


(1 - x 2 ) 2 d n+m , 2 


2"n! dx n+m 


(x 2 - 1)“ 


4.39 


4-40 


It is apparent that P$ J(x) = P„(x). Also, since P„(x) is a polynomial 
of degree n, P” (x) is zero if m > n. 

We shall now show that the functions P”(x) and PJ"(x) are or- 
thogonal in the interval — l<x<lifi?*n. The equations arising 
from 4-38 satisfied by these functions may be put in the form 


d_ 

dx 

£ 

dx 


r x) dx 


+ \n(n + 1) 


(1 - x 2 ) 


2\ dPT 


dx 


+ 


l (l + 1) 


A }' 7 - 0 


441 

442 
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0 

If we multiply 4-41 by PJ” and 4-42 by P“ and subtract, we obtain 

p: 


d j(i - x 2 ) (pr dP 


dx 


dx 


=S)1 

+ {n(n + 1) - 1(1 + 1)}P™P|" = o 4-43 

Upon integrating between —1 and +1, the first term vanishes because 
of the factor (1 — x 2 ), so that, if n ^ Z, we have 

/»+i 

J P%(x)P™(x) dx — 0 4*44 

In order to normalize these functions, let us now consider the integral 

•* +1 ^ +1 d m p n <rp n 


f {P”{x)] 2 dx = J ( 1-x 2 ) 

-f 


dx 


dx m dx 1 

l PnY m+l 

dx m dx 7 "- 1 X— i 

+1 d m ~ l P n d 




J _i dx”* 1 da: j da:”* J 


dx 4-45 


If 4-27 is differentiated m + n times, and the result multiplied by 
(1 — a; 2 )”* -1 , we obtain • 


(1 - a: 2 ) 




dx ”* +1 


- 2mx(l - a; 2 )” 


dx” 


jm—\p 

+ (n + m) (n — m + 1) (1 — x 2 )”* -1 = 0 4-46 

dx 


which is equivalent to 


d_ 

dx 


(1 - x 2 )”* 


dTP n 

dx” 


= — (n + m) (n — m + 1) (1 — x 2 )”* -1 - 4-47 

dx 

Substituting this result in 4-45, we find 
f + \p:(x)] 2 dx 


= (« + m)(n - m + 1) J* (1 - x 2 )” 1 1 dx 

= (n + m) (n — m + i )J^[PT\x)f 


' dx 


4-48 
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If this process is continued, we finally arrive at the result 

/T KM1* * “ frr^f! I" 

This last integral can be evaluated by means of the explicit expression 
4*33 for P n (x). We have 

/ +1 1 /»+l in in 

I p - W|2 * - Wft ^ - 1)n 5= (x ‘ - ir * 450 

Integrating by parts n times, this reduces to 

r +1 (~l) n /* +1 d 2n 

I rn /„\i 2 v x ) I / 9 i\n U 2 t\n 


~v= 3 ?f-, <** -»*<»»»* 


1 )"dr 




/.+! 

J (1 - ^ 


(1 + X) n <b 


)»dx=-^- r a - so-^i + x) b+i 

n + 1 J_i 

■■•i r +1 „ + rf . , iil a 2!-+ ‘ 

2) - - - 2 »J_, (2n)!(2n + l) 


- »(" - d • i r (1+ ,)^i 

(» + l)(» + 2)---2raJ_ 1 V ' 
equation 4-49 may finally be written as 

C +1 r™,/.,,!,. (» + *»)! 


J_ i K(x)] 2 


(n — m) ! 2n + 1 


, , i. w . /2n + 1 (n — m)! N 

so that the normalized functions are * / — - — - Pn\X). 

\ 2 (n + m)! n 

4d. The General Solution of the Associated Legendre Equation. 

Up to this point we have been concerned with particular solutions of 
Legendre's equations 4*31 and 4-38. We shall now show that these 
are the only solutions for the interval — 1 < x < 1 which belong to 
class Q . Since 4-31 is a special case of 4-38, we shall discuss 4*38 for 
the sake of generality; the results, of course, will hold for 4*31 as well. 

Equation 4*38 has two singular points, x = 1 and x = — 1. If we 
make the transformation x = £ + 1, the resulting equation is 


&—.L. 2 ‘ 

* d£ a + 2 


!“ + (- 


■n(n + 1) 


f + 2 ({ + 2) 2 


0 4-54 

? c' r 
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which is of the form 4-18, with 


p'ft) 


2(t + 1) 
f + 2 


«?'({) = -»(» + 1) 




ft + 2) 2 


4-55 


Since these functions are finite for £ = 0, the point £ = 0(x = l)isa 
regular point. If we now clear of fractions we obtain our equation in 
the form of 44 

<£* + 4f* + 4?) ^ + (2{> + 6£ 2 + 46) 

+ [ — n(n + 1)£ 2 — 2n(n + 1)£ — m 2 ]u = 0 4*56 


Substituting for w the power series 

u = a 0 £ L + a x £ L+1 4 4-57 


we obtain the indicial equation 

4L(L - 1) + 4L - m 2 = 0; L = ± ^ 4-58 

a! 

m 

If we take m real and positive the series beginning with £ 2 becomes 
infinite at £ = 0 in such a way that the square of the function is not 
integrable, and so the resulting function is not of class Q in a range 
including the point £ = 0 (x = 1). The solution which remains is, 
returning to £ as the independent variable, 

u = (x - 1)* (al + a[x-i ) 4-59 

A similar analysis of the other singular point shows that u must also 
be of the form 

u = (x + 1)? (<# + a'/x + • • • ) 4*60 

Now the function 

u = (1 - x 2 )? (6 0 + M + • • • ) 4*61 

to 

is of the form 4-59 if we expand the factor (1 + x) 2 and also of the 

to 

form 4-60 if we expand the factor (1 — x )*, so that the requirements 
at both singular points are satisfied if we put 

to 

u = (1 — x 2 ) 2 z; z = bo + bix + — 4-62 

The differential equation satisfied by 2 is 

(1 — x 2 ) — 5 — 2 (m + l)x — + [n(n + 1) — m(m + 1)]2 = 0 4-63 
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giving the recursion formula for the coefficients in 4-62 

{v + 1)(f + 2)6„+ 2 = [v(v - 1) + 2 (m + 1> 

— n(n + 1) + m(m + 1)]6„ 4*64 

If n — m is an integer, we find that, for the particular case in which 
v = n — m, &n~m +2 is zero, and thus the alternate coefficients 
bn-w+e, * * • are also zero. Therefore if the solution is written in 
the form 

t /, , m(m + 1) - »(» + 1) 2 , \ 

z = b 0 ( H ) 

. , ( , (m + l)(w + 2) - w(n + 1) 3 , \ 

+ h ( X H — 3T H j 4-65 

either the even series or the odd series degenerates to a polynomial 
of degree n — m. The complete solution of 4-38 is therefore 

m 

u = (1 - x 2 )*z = AP%(x) + BQZ(x) 4-66 

where Q™(x) represents the series which does not terminate. The 
terminating series must, of course, be the polynomial P% (x) which we 
have already studied. Since 

h +2 v(v — 1) + 2 (m + l)r — n{n + 1) + m(m + 1) 

__ » ______ . 

lim = 1 4 67 

V — »00 &V 

the series for Q™ converges for — 1 < x < 1, but at the points x = ±1 
the series becomes divergent. The function [Q„ (x)] 2 is therefore not 
integrable over the range — 1 < x < 1 and hence is not of class Q in 
this range. We therefore see that, aside from a numerical factor, 
P%(x) is the only function of class Q which satisfies equation 4*38 in 
the range (—1, 1). If n — m is not an integer neither series ter- 
minates. The same considerations which were applied to the function 
Qn apply here to both series, so that there is no solution of 4*38 which 
belongs to the class Q . 

4e. The Functions 0/ >m (0) and 7^ m (0, <p). We will now establish 
the connection between the functions 0*, m (0) introduced in the pre- 
vious chapter and the functions Pn(%) discussed above. We there 
derived the results: 

M 2 Fj, m = 1(1 + 1) Yi.m 4-68 


4-69 
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Yi m was found to be equal to ■ ; — 0/ m (0)e vm<f> y where m is an integer. 

v 2w 

Using for M 2 the expression in spherical coordinates (3-99), we obtain 
as the differential equation for Oi, m (0) 


1 d_ 
sin 6 dd 


( . A dQl m\ 

I g m Q I 

V do ) 


m 

sin 2 0 


Qi t m + Hl+ l)0i, « = 0 4 70 


or 


d 2 Qi t m COS 0 dBi t m 

dd 2 sin 6 dd 


1(1 + 1 ) - 


m 2 | 
sin 2 6\ 


01. m = 0 


4-71 


In order to be an acceptable solution, 0/, m (0) must be of the class Q 
over the range 0 < 6 < t. Let us now make the change of variable 
x =» cos 6. Then 


dd 


—sin 6 


A 

dx 


d 2 

dd 2 


— sin 2 6 


dx 2 


— cos 6 


A 

dx 


4*72 


and equation 4*71 becomes 


(1 ~ x 2 ) 


d 2 Ql, m 
dx 2 


- 2x 


dOj, m 
dx 


+ 


1(1 + 1 ) - 



= 0 


4-73 


Expressed as a function of x } Oi f m must be of the class Q over the range 
~ 1 < x < 1. Comparing 4*73 and 4*38 we see that 0i ffn must be 
identified with u = P™. Since m enters equation 4*73 only as m 2 , 
we must have 0/, m = 0j, _ m . The exact identification is, therefore, 
for the normalized functions: 


Ol,m(0) 


-4 


2 1 + 1 (Z 


\m. 


2 (Z + \m 


)! 


)! 


^Pj m| (cos0) 


4-74 


where |m| indicates the absolute value of m. Further, since m is an 
integer, and since equation 4-38 or 4*73 has an acceptable solution only 
if l — m is an integer, l must be integral valued. From 4*40, we see 
that the explicit expression for 0j, m (0) is 


©I. m(0) 


(-1)* / 2Z + 1 (Z- |m|)l . w rf+W(sin«g) 
2 Hi \ 2 (Z + |mj)! Sm (dcos0) I+w 


4-75 


The- explicit expressions for the normalized associated Legendre poly- 
nomials are given in Table 41 for 1 = 0, 1, 2, 3. The normalized 
spherical harmonics Yi, m (6, <?) are obtained by multiplying Oi, m (6 ) 

by 4 > m (<p) = -j= e imv . 

V 2 TT 
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TABLE 4-1 

The Normalized Associated Legendre Polynomials Bj, m (0) 


1 = 0, 

m 

= 0 

o 

o 

0 

Vi 

1 “ 1, 

m 

- 0 

Bi, o 

= v/ |- cos d 

1 = 1, 

m 

*5= ±1 

Bi, ±i 

* ‘ v/ f sin d 

1 = 2, 

m 

SB 0 

B2, 0 

= V ^(3 cos 2 0 - 1) 

1 = 2, 

m 

= dbl 

®2, ±1 

= ' s/ ^sin0cos0 

1 = 2, 

m 

- =b2 

B2, ±2 

= v ^£sin 2 0 

1=3, 

m 

= 0 

B 3 , 0 

= (f cos 4 * * * 8 0 — cos 0) 

1 = 3, 

m 


B 3 , ±1 

- V'fi (5cos 2 0 - 1) sin 0 

1 = 3, 

m 

- ±2 

B 3 , ±2 

= sin 2 0 cos 0 

1 = 3, 

m 

- db3 

B 3 , dh3 

= sin 3 6 


4f. Recursion Formulas for the Legendre Polynomials. In our 

later work, we shall have occasion to evaluate integrals of the form 



cos 0 Or, m ' dr 


and 



sin 0 m » dr 


The evaluation will be greatly simplified if we have available explicit 
expressions for the quantities cos 0 P| w *(cos 0) and sin 0 Pl m| (cos 0) 
in terms of a series of Legendre polynomials. From the formula 

4 ' 76 


it can readily be shown that the equations 

(21 + 1 )Pj = j- (Pi+i ~ Pi- 1) 4-77 

ax 

and ( l + 1 )P t = £ p w~ x £ p i 478 


are valid. If 4-77 is differentiated to times and the resulting equation 

m-H 

multiplied by (1 — a: 2 ) 2 we immediately have, using the definition 
4-39 for the associated Legendre polynomials, 

(21 + 1)(1 — = Pj+i 1 — fTli 1 4-79 


Differentiating 4-78 (to — 1) times, we have 
dx m ~ x dx m dx m 


(to - 1) 


cT-'Pi 

dx m ~ l 


4-80 
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m 

Multiplying this expression by (21 + 1)(1 — x 2 ) 2 and rearranging 
gives 

(2 1 + 1 )xP? = (21 + l)PT+\ ~ (l + m){(2l + 1)(1 - a^pp- 1 } 

= (21 + l)Pj+i - (1 + m)\P? + x - FT-x\ 

= (Z ~ m + l)P|+i + (Z + m)PiLi 4*81 

If equation 4*27 is differentiated ( m + l) times, the resulting equation 
for P\ is 

i r n + 1 P 1 dPPi 

(1 " * 2) ^ - ^-^+(l + m)(l -m + 1)-^ = 0 4-82 

m 

We now multiply by (21 + 1)(1 — a; 2 ) 2 and rearrange, obtaining 

(21 + 1)(1 - x 2 ) H PP +1 

= 2m (21 + 1 )xP? -(l + m)(l-m + 1) (21 + 1) (1 - x^FT 1 4-83 

Substituting for the terms on the right their values as given by 4-79 
and 4-81, we obtain 

(21 + 1)(1 - x 2 ) h F? +1 = Pftj{ (m - 1)(1 - m + 1)} 

+ Pjli{ (1 + m) (1 + m + 1)} 

or, replacing m by (m — 1), 

(21 + 1)(1 - x 2 ) H PT = -(l-m + l)(l-m + 2)PT+i 

+ (l + m)(l + m- 4-84 

Expressing equations 4-79, 4-81, and 4 84 in terms of cos 6 = x gives us 
the desired relations 

cos 0Pj m| (cos 0) = - { (1 - |m| + l)P]+'i(cos 0) 

+ (1 + h|)^l-i (cos0)} 4-85 

sin 0 P| m| (cos 0) = — {P|^ +1 (cos 0) - PjP'i+^cos 0) 4-86 

ZL *f“ 1 

= 2Tp[ i ^ + hl)(* + hi - D^'r^cos 0) 

- (I- hi + l)(i - hi + 2)Pft l f 1 (cos0)} 4-87 
4g. The Hermite Polynomials. Consider the equation 

1 + ^.0 


4*88 
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The solution of this equation is readily seen to be 


y - aT* 

4-89 

If we differentiate equation 4-88 (n + 1) times, we get 


d 2 z , _ dz , 

- +2x - + 2(n + l)z = 0 

4-90 

where 


dry dr _ x , 

dx n dx* K ’ 

4-91 

z is a function of the form u(x)e~ x \ where u(x) is a polynomial of 
degree n. Substituting this expression for z in equation 4*90 we find 
that u(x) satisfies the equation 

d 2 u „ du , _ 

-T-s — 2x — -f- 2nu = 0 
dx dx 

4-92 

Equation 4*92 is known as Hermite’s equation, and the particular 
solution obtained by putting c = (— l) n is known as the Hermite 
polynomial of degree n. The usual symbol for these polynomials is 
H n (x)) explicitly 

H,( x) = (-l)V*^ (<T*’) 

4-93 


The first five Hermite polynomials are: 

H 0 (x ) = 1 Hi(x) = 2x 

H 2 (x) = 4x 2 — 2 H 3 (x) = 8a; 3 — I2x 

Ht(x) = 16a; 4 - 48x a + 12 4-94 


In general 
H n (x) = (2*)* 


n(n - 1) (2a;)" -2 
1 ! 


+ 


n(n — 1) (n — 2) (n — 3) (2x)" 
2 ! 


+ 


4-95 


Differentiating this series term by term, we find 


dH n(g) 

dx 


2n 


(2x) 


n — 1 


( n — l)(n — 2) 
1 ! 


(2x) 


n— 3 


= 2nHn-i (x) 


4-96 
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Differentiating once again, we find 

d 2 HJx) n dH n — i (%) 


= 4n(n — 1 ) // n 2 


If we now substitute 4-96 and 4-97 in the differential equation 4-92, we 
obtain the useful recursion formulas 

4n(n — 1)7 / tv — 2 — inxH^i + 2 nH n = 0 

4-98 

xHn- 1 = (n — l)//n — 2 + 

or, upon replacing n by n + 1, we obtain the relation analogous to 
equation 4-85 

xli n = nH „- 1 + \H n+ i 4-99 

We shall now show that the functions H n (x)e * form an orthogonal 
set in the interval (— oo , <x> ). Let m be less than n; then 

/ °° s'* d n (e~ xi ) 

H m (x)H n (x)e~*'dx = J m H rnM~^~dx 4- 100 


Integrating by parts, we have 

, r rr , X 

^ J dx — dx n ~~ l J 


- f 4-101 

J - oo dx dx 11 

The first term on the right is zero, since e~ z<1 and all its derivatives 

vanish at x = db « . Replacing ^7^ by 2 mHmr-u we have 

a# 

J* H m (x)H n (x)e~ x ' dx = (- l) B+1 2m J (e~**) dx 4-102 

Repeating this process, we finally obtain 

/ oo ✓»«> jn m 

H m {x)H n {x)e- x ' dx = (— l) n+m 2 m rn! J ^ (e~ l2 ) & 

r jn—m—l “loo 

= (-l) n+m 2 M ^![^=i ^ = 0 4-103 

If m = n, the same process leads to 

f [/f B (x)] 2 da: = (— l) 2n 2”n! f e~ zt dx = 2 n n\^ 4-104 

•/— Q0 t/ — 00 
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1 _*? 

We have thus proved that the functions 7=— H n (x)e 2 form a 

(2 n n!V w )X 

normalized, orthogonal set. 

We must now show that these polynomials are the only solutions of 
equation 4-92 belonging to the class Q for the range (— 00 , 00 ). As 
there are no singular points in this equation (except x = ± 00 ), we 
may expand our solution about any point. Expanding about the 
origin, we find the recursion formula for the coefficients in the series 

u = a 0 + a\X + a 2 x 2 + * • • 4*105 

to be ( v -f 2) , (v *-f- l)a^_f _2 "h (2w — 2v)a„ = 0 4*106 

so that the solution is 

u = a 0 (l — nx 2 + ■ — x 4 -| ) +«i(x - — - ^ x 3 H ) 4-107 

o 6 


Since lira 

*r— ♦ 00 


a. 


- , the series always converges. In the series 
v 




!+** + - + 


+ 


x 2 * 

v\ 


4-108 


1 a 2 

a v = 7 - 7 7 7 ; f or v even; a v = 0 for v odd, so that lim = -- • We 
(j'/2)! a„ v 

therefore see that the series 4*107 behaves like e x2 near x = ± 00 . 

But neither e 2x2 nor e* is integrable from — 00 to 00 , so that neither u 

nor ue 2 is a function of class Q. If n is a positive integer, one of the 
two series in 4*107 reduces to a polynomial, which will be a constant 

j* 

times the particular solution H n (x). Then, as we have seen, H n (x)e 2 
is quadratically integrable and hence is a function of class Q. 

4h. The Laguerre Polynomials. The polynomials L a (x), of the 
ath degree in x f are defined by the equation 


LJx) = e*£- a (x“e~*) 


4-109 


and are known as the Laguerre polynomials. The /3th derivative of 
L a (x) } called an associated Laguerre polynomial, is denoted by L%(x). 
The general formula for the associated Laguerre polynomials is 




, «(a - 1)(« - £)(« - 0 - 1 ) _ a _ s _ 2 , 

+ 2! X +••• 


4-110 
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The Laguerre polynomials are not orthogonal functions, but the fime- 

tions e~*L a (x) form an orthogonal set for the interval (0, «). In 
order to prove this, consider the integral 

'• d a 


f e~ x x y L a (x ) dx = f x y — (x a e x ) dx 
J o V 0 dX 

If we integrate by parts y times, we find 


4111 


dT^ 
dx^ 


(x a e z ) dx 


(x°e-*)Y = 

Jo 


0 4112 


C e x x y L a (x) dx — (— 1) 7 7! f 

Jo J 0 

For y < a, this gives 

while for 7 = a we have 

f e~ x x*L a (x) dx = (-1 ) a ct\ f x a e~ x dx = (-l)“(a!) 2 4*113 

Jo Jo 

Since L y is a polynomial of the 7th degree, 4- 112 gives 

f e- x L y (x)L a (x) dx = 0 y <a 4*114 

Jo 

or, exchanging the roles of a and 7, 

f e~ x L 7 (x)L a (x) dx = 0 y j* a 4*115 

The highest power of x in L a (x) is (— l) a x a , so that 

f e^L^x)] 2 ^ = (-1)* f e^ x x a L a (x) dx = [a!] 2 4*116 

«/o ^0 

1 

The functions , e 2 L a (x) therefore form a normalized, orthogonal set. 
a! 

To show the orthogonality properties of the associated Laguerre 
polynomials, we consider the integral 

J r** r * $ 

e^Li dx « / <T x x? -7-3 L a dx 
0 Jo dx? 

Integrating this by parts times gives 

£ e~*x y Li dx - (- 1)* f L a (e~*x y ) 


dx 


4117 


4-118 


or 

as the integrated part always vanishes. The first term in (e~*x y ) 
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is ( — l) p e' x x y . For y < a, we therefore see from 4-112 that 


f e x x y L% dx = 0 

Jo 


Now the function is a polynomial of degree 7 , so that we have 
the result 


J* 00 

e x x^L^Li dx = 0 7 < a 

0 


or, since a and 7 may be exchanged 

j *°° 

e~ x x&LyLadx = 0 7 a. 4-120 

0 

( — 1 )«qj! 

For 7 = a, the first term in is 7 ; x a . We therefore have, by 

(« - 0 ) * 

4-118 and 4113, 

f dx = f dx 

*/o (a — p)i J 0 

C-ira! /•" (a!) 8 

= -^4 / e~ x x a L a dx = - - 4-121 

(a -0)1 Jo (« -£)! 

The functions <*/ — 0 *** therefore form a normalized orthog- 

\ (a!) 3 

onal set in the interval ( 0 , °o ). 

In the theory of the hydrogen atom we will need the value of the in- 

J * 00 

e^x^L^L^ dx. By expanding a^ + 1 L£ by means of 4-110, 
0 

and retaining only the terms in x a+1 and x a , since lower powers of x 
will integrate to zero, we have 

f e~ x x p+1 LiL B a dx = { f e~ x x a+1 L B a dx 

— a{a - 0) J' e~ x x a L B dxj 4-122 

Using 4-118, and retaining the first two terms in the expansion 6 f 
(e~ x x a+1 ), this reduces to 

f. - ^i{X‘ 

- [p(a + 1) + <*(« - P)] f e~*x a L a dx 4-123 

«/o i 
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The last integral has the value (— l) a [a!] 2 , according to 4413. For 
the first integral, we have 

/•°° r* d a 

I e^x^La dx = / x a+1 — (x*e~ x ) dx 
J o «/ o dx 

or, after integrating by parts a times, 

f e-*x a+1 L a da: = (— l)“(a + 1)! f x a+1 e^dx 

«/ 0 0 

= (-1)“[(« + l)!] 2 4-124 

Substituting these values in 4-123 gives us the final result 

f e- x xt +1 L B a L e a dx = { (a + l) 2 - [/?(« + 1) + «(« — 0)]} 

J 0 (a — p) l 

(2a — /3 + l)(a!) 3 




4 125 


In order to find the differential equations satisfied by the Laguerre 
polynomials and their derivatives, consider the equation 


x< ^+ (x - a)y = 0 


4 126 


4127 


which is satisfied by y = x a e x . If we differentiate this equation 
(a + 1) times, we find 

d 2 z dz 

x d? + {x + 1) fc + (a + 1)2 = 0 

where 

2 = B - S ^ 

Substituting this value of z in equation 4 127 gives us the differential 
equation for the Laguerre polynomials 


d 2 L a 
dx 2 


+ (1 — x) - + aL a = 0 
ax 


and differentiating this equation times gives 


4-128 


4-129 


d?L a 

where u = = Lf as the differential equation for the associated 

Laguerre polynomials. 
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We shall now consider the general solution of this equation. The 
point x = 0 is a singular point, but it is regular, so that we take as our 
solution the series 

u = a 0 x L + a\x LJrl + • • • 4-130 

Substituting this series in 4- 129 gives the indicial equation 
L{L + /3) = 0 L = 0, L = — (3 

The series beginning with x~ ^ cannot be of class Q in any region in- 
cluding the point x = 0; therefore there remains only the series 

U — (Xq -f- tt\X -f* 0>2X^ -f- * * * 4*131 

The recursion formula for the coefficients is readily found to be 

O' + (3 + l)(v + l)a y+ i = (v + fi — a)a; 4*132 

so that 



V— >30 tty V 


The series always converges, but, as v — » « , the limiting ratio of the 

coefficients is the same as that in the series expansion of e x . Hence 

_fc fi 

the function u is not of class Q, nor is e 2 x 2 u unless the series terminates. 
This is possible only if a — (3 is an integer greater than zero, in which 
event we have a constant multiple of Lf. Where we shall use this 
function, p will be a positive integer, so that a must be an integer 
greater than P if there is to be a solution of class Q. 
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THE QUANTUM MECHANICS OF SOME SIMPLE SYSTEMS 


In this chapter we propose to treat by the methods of quantum 
mechanics certain of the simple systems for which the Schrodinger 
equation can be solved exactly. These systems are an idealization 
of naturally occurring systems, but the consideration of them is not 
without value, as they furnish an insight into the methods of quantum 
mechanics and give results which are useful in the discussion of many 
problems of physical and chemical interest. 

5a. The Free Particle. The simplest imaginable system would be 
a particle of mass m moving in the x direction under the influence of 
no forces. The classical Hamiltonian function for this system is 



6-1 


where the value zero has been chosen for the constant potential V. 
The eigenvalues E x for the energy are given by the solution of the 
equation 


h i 2 dV 

8t r 2 m dx 2 ** 


5-2 


A possible solution of this equation is 


2*i 

i = N(E x )e >> 


5-3 


In order that the wave function remain finite at x — ± °o , the quantity 
v 2mE x must be real, and so E x must be positive. As this is the only 
restriction on E x , we conclude that all possible values of E x from 0 to 
+ oo are permissible; that is, we have a continuous spectrum of energy 
values. Classically, the energy is related to the momentum by the 


relation E x = — 
2m 


pi, so that in terms of the momentum we may write 


i = N(p x )e 


54 


Let us now calculate the average value of the momentum associated 

68 
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with the wave function 5-3. According to equation 3-68, this is 



5*6 


An equally valid solution of equation 5-2 is 


vf- = N (E x )e = N(p x )e * 




5-6 


The average value of the momentum associated with this wave function 


is -V2. The wave function 54 therefore represents a state in which 
the particle is moving in the +x direction with the definite momentum 
Vpf; the wave function 5-6 represents a state in which the particle 
is moving in the — x direction with the same absolute value of the 
momentum. Let us now consider the probability that the particle, 
in the state represented by 5-3, will be in the region between x and 
x + dx. According to Postulate I, this is 4 / *4' dx — N*N dx. We 
thus see that all regions of space are equally probable, so that the un- 
certainty in the position of the particle is infinite. This is, of course, 
required by the uncertainty principle, for, if Ap x is zero, as we have 
found it to be, Ax will be infinite in order that the relation A p x Ax ~ h 
will be true. Any linear combination of the solutions in 5*3 and 5*6 
involving the time 


* = N(E x )e h 


2x% V2mE x x - t 


5’3a 


* - N(E x )e 


2xt 

h 


e 


2xiE x 
h 1 


5-6a 


is also a solution of the wave equation 

h 2 d 2 \ j/ h d 

8w 2 m dx 2 2 iri dt 


If we take the combination representing the sum of all possible wave 
functions 5*3a and 5-6a, the waves will reinforce one another at some 
particular point x = x 0 and will interfere everywhere else, so that in 
this event the particle can be located exactly. However, the uncer- 
tainty in the momentum is infinite. A more complete analysis 1 shows 
that the smallest possible simultaneous uncertainties in position and 
momentum are governed by the relation Ap x Ax ~ h; in this way 
the uncertainty principle is derivable from our fundamental postu- 
lates. 

1 R. C. Tolman, Principles of Statistical Mechanics , Oxford, 1938, p. 231. 
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5b. The Particle in a Box. We now consider a particle of mass m 
constrained to move in a fixed region of space, which for simplicity we 
take to be a rectangular box with edges of length a, 6, and c and volume 
v = abc. The potential V may be put equal to zero within the box; 
at the boundaries of the box and in the remainder of space we put 
V = oo . We take our coordinate system to be the cartesian coordinate 
system with the origin at one corner of the box and the x , y , and z 
axes along the edges of length a, b , and c, respectively. The potential 
energy is then 


V x = 0, 0 < x < a; V x = oo otherwise 

V v = 0, 0 < y < b; V y = oo otherwise 

V z = 0, 0 < z < c; V z = oo otherwise 

Schrodinger’s equation for the system is 

d 2 \l/ t d 2 \[/ d 2 \p . 8ir 2 m 

dy 2 ^ dz 2 


ax‘ + l? + li‘ + '~W (E 


V)f = 0 


5-7 


We have a differential equation in three variables. In order to solve 
this equation we seek a solution of the form 


* = X(x)Y(y)Z(z) 5-8 


where X (x) is a function of x alone, etc. If we now put V = V x + F„ 
+ V z , and substitute 5-8 in 5-7, we obtain 


1 d 2 X(x) _1_ d 2 Y(y) 
X(x) dx 2 + Y (y) dy 2 


+ 


8 r 2 m 


{E-V x 

1 

m 


-Vv) 

d 2 Z(z) 

dz 2 


+ 


8 x 2 m 

IF * 


5-9 


On the left we have a function of x and y only; on the right, a function 
af z only. If we vary x and y, keeping z constant, the right side re- 
mains constant; therefore the left side must be a constant. Let us 

sail this constant ^ o m E z . We then have the equations 




d 2 Z ( 2 ) 8 ir 2 m 

W + — (E. - V,)Z(z) = 0 


dz 2 




5-10 


l d 2 x(x ) 

f(x) dx 2 


, 8 T 2 m _ 

H — ^2 {E — E z — 7*) 

1 d 2 Y(y) f 8^m jr 
Y(y) dy 2 + h 2 


511 
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By the same reasoning followed above, both sides of this last equation 
must be equal to a constant, which we call ~~j- E y . This gives us the 

tv 

additional equations 

d 2 Y(y) 8ir 2 m ir 

— T5- + l 2 ( E v ~ V u )Y(y) = 0 5-12 


dy z r 

d 2 X(x) 
dx 2 h 2 


V x )X(x) = 0 


where, in 5*13, we have put E = E x + E y + E z . We thus have three 
differential equations, each involving one variable only, and which are 
thus readily solvable. Let us consider first the equation in x. Since 
V x — oo for x > a, x < 0, 5-13 will hold in this region only if we put 


X(x) = 0, 


d 2 X(x) 


= 0. For the region inside the box the equation is 


d 2 X(x) 


[ E x X(x) 


for which the general solution is 


IW - X.¥'® r - + Be - “ vssr - 


X(x) = A f cos V 2 mE x x + B' sin "■ V 2 mE x x 5-16 

h h 

In order that the solution for the region inside the box join smoothly 
with the solution for the region outside the box, we must have X(x) — 0 
at x = 0, x = a. The first of these considerations requires that 

A' = 0; the second requires that mE x a, = n x tt, where n x is an 

h 

integer (not including zero, as this value of n T would make X(x) equal 
to zero everywhere). This limitation on the nature of the wave 
function at the edge of the box requires that the energy be quantized; 
that is, only those values of the energy given by 

fi 2 h 2 * 

Ex = 7T~ — o n x — 1) 2, 3, • • • 5-17 


will give an acceptable wave function. This wave function may now 
be written as 

X(x) = B' sin ^-x 5-18 

a 
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The normalization factor is given by the integral 

f“ [X{x)] 2 dx = B' 2 f“ sin 2 x'jdx = 1 5-19 

/ /2 

which gives B = * The equations in y and z are solved in the 

\ G 

same manner. The final results are therefore 


*(*, y,z)~X(x)Y(y)Z(z) 


8 . n x 7T . TlyTT . 7l z 7T 

— sm — x sm — y sm — z 
abc a b c 


8 . n x w 

\h sm — 

\v a 


TlyTT . Tl z TT 

x sm — ■ y sm — z 
b c 


E = E x + E y + E 2 


__ tf_/n 
8m \a 


2 2 \ 

K . K\ 


± JL. OL + 11 
2 ' ' _2 


where w*, n v , w, = 1, 2, 3, 4, • • • . These results will be of importance 
in connection with the theory of a perfect gas, and also in the theory of 
absolute reaction rates. 

6c. The Rigid Rotator. The theory of the rigid rotator in space yill 
be of value in the discussion of the spectra of diatomic molecules. ‘As 
an idealization of a diatomic molecule, we consider that the molecule 
consists of two atoms rigidly connected so that the distance between 
them is a constant, R. As we are not interested here in the transla- 
tional motion of the molecule in space, we may regard the center of 
gravity as fixed at the origin of our coordinate system. Suppose that 

m 2 

the polar coordinates of one atom are a. 0, <p, where a = R. 

mi + m 2 

and mi and m 2 are the masses of the two atoms. The polar coordinates 

of the other atom are then b. 6, where b = — R. The kinetic 

mi + m 2 

energy of the first particle is 

?[(f) ,+ 0r) !+ (f )>TW + ^y] 532 

Similarly, the kinetic energy of the second particle is 

so that the total kinetic energy is 

m WllO 2 + to 2 6 2 f (d6\ 2 , ^_oJdtp \ 2 ' I 


[©'*-©1 
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If we put mi a 2 + mjb 2 = I (the moment of inertia), then the kinetic 
energy may be written as 

'-[[©'*-■•(81 


which is the same as that of a single particle of mass I confined to the 
surface of a sphere of unit radius. From the expression for the La- 
placian operator in spherical coordinates (Appendix III) we see that the 
Hamiltonian operator is 

_ ~h 2 n a / 9 d\ . 1 d ( . 3\ 


ri e ( 9 d\ 

1 d t 

( . d\ 

-5 -(r 2 - 

-f- o . 1 

. sm 9 — ) 

j 2 3r\ 3r/ 

r 2 sin 9 39 ' 

V 99) 


+ ~2 : 

r sur 


1 dH 

in 2 6 d<p 2 J 


+ V 5*25 


If no forces are acting on the rotator we may put V = 0, and, putting 
r = 1, mr 2 = m — I, we find that Schrodinger’s equation is 

Id/ dA 1 d 2 ^ 8 tt 2 IE , ^ 

— ( sin 9 — ) + n o H To — — 0 5*26 

sin 6 36 \ 36/ sm 2 6 3(p 2 h 2 

We have once again a differential equation with more than one inde- 
pendent variable, so that we look for a solution of the form 

\P = 0 ( 0 )$(*>) 5*27 

After making this substitution we may write equation 5-26 as 


d / d0\ 

— ( sm 6 — J 
36\ 36/ 


St 2 IE 


surf 6 = — 


1 d 2 $ 


By the same line of reasoning as was employed in the previous section 
both sides of this equation must be equal to a constant, which we take 
equal to m 2 . We thus obtain two differential equations 

d 2 <f> 

“7? = —m 2 $ 5*29 

d<P 2 


1 39/ 39\ 

1 sm 6 — ) 

sin 6 36 \ 36/ 


m 2 n , 8tt 2 IE „ n 

: 2 „ ® 4 To — 9 = 0 

sm 2 6 h 2 


Equation 5-29 is seen to have the solution 

$(<p) = ce ±im>e 5-31 

We have previously shown (section 3h) that this is an acceptable wave 
function if m is an integer; the normalized solution of 5-29 is therefore 

• _ IK l* I 1*111 I 1 — . n — u OK 


= $±m(<p) = 


m — 0, 1, 2, 3 
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4 . : 

In equation 5*30, let us replace 


Sir 2 IE 

h 2 


by 1(1 + 1). 


becomes 


This equation then 


1 8 ( n de \ 

sin 6 66 \ 66 / 



0 + i(i + 1 ) e = 0 


5*33 


which is identical with equation 4-70, and therefore has acceptable 
solutions only for integral values of l; l > \m\. The normalized 
solutions of 5*33 are therefore 


- e, ±mW = •) 5-34 


The restriction on l requires that only those values of the energy given 
by the relation 

E = ~Si l{l + 1) l = 0 ’ 1)2>3 " 5-35 


are allowed. The total wave function is, of course, 

^ = &l, ±m(Q)$±m(<P) = Y l, ±m(0, *p) 

As we have seen in sections 3h and 4e, these w r ave functions satisfy 
simultaneously the equations 


M 2 Fj, ±m = 1(1 +l)~ 2 Yi, ±m 

= dzTfl Y i t 


5*36 


In addition to being eigenfunctions of the Hamiltonian, with the eigen- 

h 2 

values for the energy E = 1(1 + 1), these functions are also 

eigenfunctions of the total angular momentum, with the eigenvalues 
h 2 

M 2 = 1(1 + 1) and of the z component of the angular momentum, 


with the eigenvalues M z = ±m — . All these variables thus have a 

constant value simultaneously. In general, when only two coordinates 
are needed to describe a system, we would expect only two dynamical 
variables to be simultaneous eigenvalues. The reason that we have 
three here is that classically there is a relation between the energy and 
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M 2 


21 


the square of the total angular momentum, this relation being E 

f 

which is also true for the eigenvalues given above. 

5d. The Rigid Rotator in a Plane* ,»As a special case of the above 
problem, let us restrict the motion to dotation in a given plane, which 
we may without loss of generality take to be the xy plane. In this 
event, 0 has the constant value of 90°, so that equation 5*26 becomes 

<k 2 ~ h 2 * 5-37 

Comparing this with 5*29 we see that the eigenfunctions are f 

K 2 


with the energy eigenvalues E 


8 7T 2 I 


mr . These eigenfunctions foi 


the energy are also eigenfunctions for M z , with the eigenvalues ±m 


2x 


6e. The Harmonic Oscillator. Many systems of interest can be 
approximated by harmonic oscillators; for example, the vibrations of a 
diatomic molecule and the motions of atoms in a crystal lattice can be 
treated to a first approximation as motions of a particle in a harmonic 
field, A harmonic oscillator is a particle of mass m moving in a straight 
line (along, say, the x axis) subject to a potential V = \ kx 2 , so that 
the force on the particle is —kx. Classically, the equation of motion is 

d?x 

m ~ o' — —kx 5*3£ 

dt 2 

with the general solution 

x — a cos 2 Trv(t — to) 5*3S 


where a and to are constants and 
*2 


JL /* 

" ~ 2x \m* 


The kinetic energy is 


i ( dx \ 


2rmr 2 v 2 a 2 sin 2 2irv{t — to); the potential energy is \kx 


2 _ 


2mTCv 2 or cos 2 27rv(t — t 0 ) } so that the total energy is E = 2m7rVa‘ 
k 

- a 2 ; all positive values of E are allowed. 

The classical Hamiltonian for the system is 

so that the Hamiltonian operator is 

h 2 d 2 . k - 


2 „ 2„2 = 


5-40 
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The wave equation for the system is therefore 
dV . 8ir 


dx 2 


lir 2 m ( k ,\ 


0 


5-42 


and our problem is to find those functions of class Q which satisfy this 
equation. This equation may be rewritten as 

^ + (« - A 2 )* = 0 


543 


. 8 T 2 m _ 2i r\/rnk 

where a = — j- E, 0 = — - — . 


We can further simplify this 


equation by making the change of variable £ = V(5x. Then — 2 = 

ax 

d 2 

0 — 2 , and the equation in £ becomes 


d$ 


2 + 


( I 


544 


Let us first see what form we must take for ^(£) for very large values 
of £ in order that we may have an acceptable wave function. For 

sufficiently large values of £, ~ may be neglected in comparison with 

0 

£ 2 , so that in this region ^(£) must approximately satisfy the equation 


d £ 2 4 v 


5-45 


The solutions of this equation are approximately f = ce 2 , since 
d 2 ± £ 

— (e 2) = e 2 (£ 2 ± 1), and the factor ±1 may be neglected in 

comparison to £ 2 in the region of large £. We cannot use the solution 
+1? 

e 2 , as this is certainly not of class Q ; the solution e 2 will, however, 
behave satisfactorily at large values of £. These considerations sug- 

gest that a solution of 5-44 of the form ^(£) = u(£)e 2 might be found. 
Making this substitution, we find that w(£) must satisfy the differen- 
tial equation 

d 2 u 
d? 


- l^w - 0 5-46 

Comparing this with equation 4-92, we see that the two equations are 


du , 
2t + 
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— 1^ by 2 n. w(£) is therefore ff n (£), and 
_L* 

the wave function <K£) is cH n (£)e 2 , which, as we have seen, is a 

function of class Q for all positive integral values of n, including zero. 
This restriction on n gives us a corresponding restriction on E. We have 

- = 2n + 1 5-47 

P 

or, substituting the full expressions for a and ($ and simplifying 

E = T-J- (» + *) = (» + 4)^ 5-48 

27T \ Wl 

The allowed energy values are thus f • • • times the energy Ay 
associated with the classical frequency of oscillation. 

We now need to find the constant c such that 

f* dx = [ff»(€)] 2 «" e, « = 1 5-49 

<T$ 

From equation 4-104, we see that we must take c = —====.' The 

V2 "uIVt 

normalized wave functions for the harmonic oscillator are therefore 

/V -1* r 

MS) = #•(«)« 2 5-50 

The first few energy levels and the corresponding wave functions are 
shown graphically in Figure 5*1. We note that the wave functions 
are alternately symmetrical and antisymmetrical about the origin. Of 
particular interest is the fact that, on the basis of quantum mechanics, 
the harmonic oscillator is not allowed to have zero energy, the smallest 
allowed energy value being the “ zero-point ” energy § hv. This is in 
accord with the uncertainty principle; if the oscillator had zero energy 
it would have zero momentum and would also be located exactly at the 
position of minimum potential energy. The necessary uncertainties 
in position and momentum give rise to the zero-point energy. A sim- 
ilar situation exists for the particle in a box, which also has a zero-point 
energy. In the rotator, the lowest state corresponds to the situation 
in which all orientations in space are equally probable; the uncer- 
tainty in position is therefore infinite, hence the momentum, and 
therefore the energy, may have the precise value zero. 


identical if we replace 


(i 
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Let us now calculate the average value of the displacement of the 
particle from its equilibrium position. We have 



10 8 X (cm.) 


Fig. 5-1. Energy levels and eigenfunctions for the harmonic oscillator. 

The recursion formula 4-99 for the Hermite polynomials states that 
£#n(£) = nHn- 1(£) + !#n+l (£) 

The integral therefore vanishes because of the orthogonality of the 
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wave functions, so that x — 0. This was, of course, to be expected, 
since the squares of the functions are symmetrical about the origin. 
For the average value of x 2 , we have 

i \/b/t r x 

x 2 = ^ J H n m 2 H n da 5-52 


From the recursion formula, we have 
% 2 H n — n£H n - 1 + ^H n+ 1 

= n[(n - 1 )Hn -2 + hlln] + |[(n + 1)11 n + \lI n+2 ] 5-53 


Since only the terms in H n contribute, the integral 5-52 becomes 

? ■ & ( ° + 4 ) X„ ^ 


— E 

Using the relations derived above, this may be written as x 2 = — or 

tc 


E = kx 2 . Now, classically, x 2 = , so that in terms of the mean 


square displacement from the equilibrium position we see that the 
energy is given by the same relation in both cases. 



CHAPTER VI 

THE HYDROGEN ATOM 


6a. The Hydrogen Atom. In this chapter we will treat by the 
methods of quantum mechanics the simplest atomic system, the hy- 
drogen atom. The hydrogen atom consists of a proton, of charge -J-e 
and mass M, and an electron, of charge — e and mass m. These two 
particles attract each other according to the Coulomb law of electro- 
static interaction. If we denote the coordinates of the proton by aq, 
yi, and Z\, and the coordinates of the electron by x 2 , y 2 , and z 2 , the 
potential energy is 


V (Xi - x 2 ) 2 + (z/, - y 2 ) 2 + (21 - Z 2 ) 2 


The classical Hamiltonian function, in this coordinate system, will 
then be 


H 


Mt/dxiV 

2 L \dt) 


+ 


(dyiY 

\ dt ) 


+ ( d ± 

\di 


+ • 


m 


;)] 


+ 


( dz 2 

Tt 


)1 


+ V 6-1 


Because of the form of the potential energy, the Hamiltonian function 
is not at all simple in this coordinate system. We therefore introduce 
the following change of variables. Let x, y , and z be the coordinates 
of the center of gravity. Referred to the center of gravity of the 
system as origin of a spherical coordinate system, let the spherical 
coordinates of the electron be a, 6 , (p and those of the proton 6, 0, <p, 

where a = — r,b = - — r and where r is the distance between 

M + m M + m 

the proton and the electron. In this coordinate system, the potential 

energy now has the simple form V . In terms of the new coor- 

r 

dinates, the rectangular coordinates of the proton and electron are 


M ii 

xi ss % — — r sm 6 cos (p = x H r sin 6 cos <p 

M m 

yi - y T} r sin 6 sin <P V 2 - y + — r sin 6 sin <p 

M m 


6-2 


Z\ = z — — r cos i 
M 


Z 2 = z -} '/cos 6 

m 
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where /x = — is the “ reduced mass ” of the system. With the 

M + m 

introduction of this change of variables, the classical Hamiltonian 
function becomes 


H 


M + m 


r ( *y + ( — v 

L \dt) + \dtj 

4(!) 2 


+ 


( 1)1 

2 f de \ 2 2*2/1 /*>Y] 

\dtj \dtj J 


6*3 


The first term is the kinetic energy, in rectangular coordinates, of a 
particle of mass M + m; the second term is the kinetic energy, in 
spherical coordinates, of a particle of mass /*. The Hamiltonian oper- 
ator for the system is therefore 



so that Schrodinger’s equation for the hydrogen atom is 


8 t 2 (M + m) 


+ + 


+ 


dx 2 1 dy 
h 2 


dz 4 




Sir 




+ 


r 2 sin 2 d dtp' 


i' + ( E ' + ^ i' = 0 6-5 


It is apparent that we can separate this wave equation into two equar 
tions, one containing x, y, and z only, the other containing r, 6, <p only. 
Carrying out this separation in the manner employed in the previous 
chapter, we set = x(x, y, z)^{r, 6, <p) and obtain the two equations 


d 2 X . d 2 X . d 2 x 8ir 2 (M + m) 
dx 2 + dy 2 + dz 2 + h 2 


(E' -E) X = 0 


6-6 


i a / , aA 

r 2 dr\ dr) 


+ 


1 d ( . aA 

r 2 sin 6 dd \ m dd) 


+ 


1 aV 8 tt 2 m 

r 2 sin 5 a/ h 2 



* 


0 
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Equation 6*7 contains only the relative coordinates of the two particles, 
so that E is the internal energy of the hydrogen atom. Equation 6*6 
is just the wave equation for a free particle of mass M + m, with 
translational energy E f — E. This equation has already been dis- 
cussed, and we will not consider it further in this chapter. It is ap- 
parent that in the treatment of any atomic or molecular problem the 
translational degrees of freedom of the system may be separated from 
the internal degrees of freedom in the same manner and thus need not 
be considered in general. 

To separate the variables in 6-7, we make the substitution 

<p) = R{r)Y{B t <p) 

obtaining 


I 

R dr \ dr ) 



l 

Y sin 6 86 



1 d 2 Y 
Y sin 2 0 8<p 2 


6-8 


By our usual argument, both sides of this equation must be equal to a 
constant, which we call X. We then have the two equations 


1 8 ( . 8Y\ 

sin 6 86 \ 86 ) 



6-9 



610 


Equation 6-9 is by now quite familiar. The allowed solutions are 
Y = Yi,± m {6, <p ), where X = 1(1 + 1), with l and m integers, and 
! > |m|. It should be mentioned at this point that, in all problems in 
which, in spherical coordinates, the potential energy can be written 
is a function of r only, the separation of the wave equation will proceed 
in the same manper as above, giving 7;, ±m as the angular part of the 
wave function. Introducing the required value X = 1(1 + 1) into 
3-10, and expanding the first term, we have 


2 dft 
dr 2 r dr 



i(i + in 

r 2 . 


R = 0 


611 


We must now consider separately the two cases where E is positive 
md where E is negative. With negative E, equation 6-11 may be 
simplified by the introduction of a new parameter n, defined by the 
relation 


2w 2 p.e 


612 
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and a new variable x defined by 

r = 


nh 2 


Equation 6*11 is reduced by these substitutions to 

dPR 2 dR ( 1 n_ 1(1 + 1) ' 

dx 2 x dx \ 4 + x x 2 


R = 0 


If we look for a solution of the form 

X 

R = u(x)x l e 2 


we find that u(x) must satisfy the differential equation 

d 2 u dii, 

x — 2 + (21 + 2 — x) - — (- (n — l — 1 )u — 0 
dx * dx 


6- 13 


644 


6 15 

616 


This is the same as equation 4429 if we put /? = 21 + 1 and a = n + l. 
We have seen that equation 4429 possesses satisfactory solutions only 
if a — (3 is a positive integer. Now a — is equal to n — l — 1, and, 
since l may have the values 0, 1, 2, 3, • • *, n may have the values 
1, 2, 3, * • • , with the restriction that n > l + 1. This gives the 
allowed negative values of the energy 


E = - 


2ir 2 fxe 4 

H 2 h 2 


n = 1, 2, 3, • • • 


647 


which are identical with the values obtained in Chapter I by means of 
the Bohr theory. 

The radial wave functions for the hydrogen atom are therefore 


R(r) = cx l e 2 Lf$(x) m t x = r 


nil 2 


To determine c, we take 


r z dr — 1 


But 


j <»00 

[ ft ( r )] 2 

0 

f [R{r)] 2 r 2 dr — c 2 f x 21 c~ x [Lf^ {x)fr 2 dr 

t/n *s n 


648 


619 

6*20 


We may write r 


nao . hr 

— x, where a 0 = —5 — 2 
2 4ttV 2 


is the radius of the first 


Bohr orbit, as is easily verified from the equations in Chapter I. Then, 
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by means of equation 4- 125, 
f [fi(r)]V dr = c 2 

«/n 


(t)X 


dx 


2 / wflp\ 8 2n[(n + Q!] 3 
C V 2 ) (n — l — 1)! 


so that 


R(r) ~ \ l(m J 2n[(n + i) !] 3 (na 0 ) * ^ ^ (na 0 ) 


6*21 


6*22 


where the minus sign has been introduced to make the functions 
positive. 

6b. Jlydrogenlike Atoms. The problem of the ionized atoms He + , 
Li’ H ", etc., is identical in principle with that of the hydrogen atom, the 
only distinction being a slight difference in the reduced mass p, and 
a numerical factor Ze 2 in place of e 2 in the potential energy. We can 
therefore write down immediately the solution for the general hydrogen- 
like atom of nuclear charge Z and nuclear mass M z . The solutions 
are 


t = R n ,i(r)Y lt±m (0, <p) 


121 +1(1- 

\m\)\ 

1 4x (1 + 

|m|)! 


6-23 


6*24 


l(n — Z — 1) ! /2Z\ s /2Zr\ l 
Rn ' l{T) ~ \2n[(n + 0!] 3 U/U/ 


e no ° Ln+i \ 


( 2Zr\ 
\nao/ 

kb-l-Vl (Z\« /2gV ^ /2g\ 

\ n 4 [(n + Z)!] 3 \a 0 ) \n) 1 \n) 


6-25 


where p = — r. Certain of the normalized radial wave functions for 
do 

hydrogenlike atoms are given in Table 61. The energy levels for these 
atoms are 


2.4 


E z = 


— 2ir 2 n z Z 2 e 

vW 


r7 2 JP . 

mh 


MZ : 


Mtfn 

M z + m 


6*26 


Mz 

Since — is very close to unity, the energy levels of hydrogenlike atoms 
mh 

are given to an excellent approximation by the relation E z = Z 2 E B , 
where E s represents the energy levels of hydrogen. 
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The energy levels as given by 6-26 are in essentially perfect agree- 
ment with the experimental results. The above treatment of the 
hydrogen atom neglects certain small energy terms, the first being 
that due to “ electron spin,” which, although very small for hydrogen, 
is important in the general theory of atomic structure and will be con- 
sidered in a later chapter, the second being a small relativity correction 
which we shall entirely disregard, as these relativistic effects are neg- 
ligible for small energies and hence can be ignored in problems of chemi- 
cal interest. 

6c. Some Properties of the Wave Functions of Hydrogen. From 
the form of the wave functions, we see that they sati 3 fy simultaneously 
the equations 

H \l/ n , l, ±m = E n 

= 1(1 + 6-27 


M, \[/ n> it ±m = Z, ±m 


so that the energy, square of the total angular momentum, and z com- 
ponent of the total angular momentum are simultaneous eigenvalues. 
Each wave function of hydrogen is specified by the three quantum 


TABLE 61 

Normalized Radial Wave Functions R ( r ) for Hydrogenlike Atoms 


p = — r; n ^ l + 1 
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1*8, 0 — /-l 

81V 3 \°o. 
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numbers n, Z, and m. The quantum number n determines the energy 
of the atom; the quantum number l determines the total angular 
momentum; and the quantum number m determines the z component 
of the angular momentum. In Figure 6T we have plotted the radial 
wave function R(r) for several of the lowest energy levels of hydrogen. 
It is noted that the radial wave function has a non-zero value at r = 0 
only for those states for which Z = 0, that is, only for those states which 
have no angular momentum. The radial wave functions become 
zero n — Z — 1 times between r = 0 and r = <x > . 

According to Postulate I, the probability of finding the electron at 
a distance between r and r + dr from the nucleus, with its angular 
coordinates having values between 9 and 9 + dd, <p and <p + d<p, is 

W dr = [R n , i(r)] 2 [Y it ±TO (0, <p)] 2 r 2 sin 9 dr dd dip 6*28 


An alternative way of viewing this situation is to consider the electron 
as having a spatial distribution, the density of the “ electron cloud ” 
at any point in space being given by the square of the wave function 
at that point. The angular distribution of the electron density is given 
by the square of the spherical harmonics Yi t±m (9, <p). Referring to 
Table 44, we see that the state with l = 0, m = 0, is spherically sym- 
metrical about the origin. The state with l = 1, m = 0, has the maxi- 
mum value for the electron density along the z axis; the states with 
/ = l f rn = 1, have the maximum value for the electron density in the 
xy plane. 

To determine the probability that the electron be between the dis- 
tances r and r + dr , regardless of angle, we must integrate over 6 and 
<p. Since the spherical harmonics are normalized to unity, this inte- 
gration gives us simply 

P(r) dr = [R nt i(r)] 2 r 2 dr 6*29, 

In Figure 6*2 we have plotted this probability distribution for several 
of the lower states of the hydrogen atom. The average distance of 
the electron from the nucleus is given by the integral 

f = J \l/*r\{/dr =5 J [72 n , j(r)] 2 r 3 cZr 6*30 


For the hydrogen atom in the lowest state (n = 1, l = 0), the average 
value of r is 


f 


4f 

Oq Jq 


2 r 



6*31 
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that is, the average distance of the electron from the nucleus is three- 
halves the radius of the first Bohr orbit. The most probable value of r 
is found from the equation 

dP(r ) 2 

— = e a ° r 2 + 2re a ° = 0 6-32 

dr a 0 

so that the most probable distance of the electron from the nucleus is 
exactly ao. The average value of r 2 is 

r 2 = J f e “°r 4 dr = ^ 4! = 3a§ 6-33 

# 0*^0 &Q \* / 



States of the hydrogen atom with l = 0,1, 2, 3, 4, 5, • • • are known 
as s, p , c?,/, g, h, • • • states, respectively. A state with n = 3, l = 1, is 
called a 3 p state; a state with n = 1, l = 0, is called a Is state; etc. 
It will be noted that there are three p states, with m = 1, 0, — 1. These 
may be designated as 1 , p 0 , and p__i states, respectively. The angu- 
lar factors associated with these states are, aside from a numerical 
factor, 

p+i ~ sin 6e i,p 

Po ~ cos 6 6*34 

p_i ^ sin 
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For many purposes it is more convenient to replace these functions 
by the following linear combinations: 

p+i + P-i 

p x 7 = ~ sm 0 cos <p ~ x 

V2 


Pt = Po ~ cos 0 ~ z 
. P+i ““ P-i 


6*36 


Vv 


Vi 


~ sin 0 sin ip ~ y 


the designations p Xl p Vy and p z indicating that the angular part of these 
wave functions have their maximum values in the x y y } and z directions 
respectively. Similarly, for the d functions, we take the linear combina- 
tions 

d z t = do ~ (3 cos 2 0 — 1) ~ 3z 2 — 1 
d + 1 + d_i . 

d XM = 7= — ~ sin 0 cos 0 cos <p ~ xz 


Vi 

djL.\ — d i 

d yg = — i 7= ~ sin 0 cos 0 sin ^ i/z 


dx%—yt — 


V2 

d-f-2 + d—2 

V 2 


6-36 


1 sin 2 0 cos 2(p ~ sin 2 0(cos 2 <p — sin 2 <p) ~ x 2 — y* 




— % 


d-f 2 — d _2 

V2 


- sin 2 0 sin 2<p ~ sin 2 0 cos <p sin <p ~ xy 


In Table 6*2 we give the complete normalized hydrogenlike wave func- 
tions for n = 1, 2, 3, using the above form for the angular part of the 
wave functions. 


TABLE 6*2 

Normalized Hydrogenlike Wave Functions 
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TABLE 6-2 ( Continued ) 
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6d. The Continuous Spectrum of Hydrogen. We must now con- 
sider those states of the hydrogen atoms for which E is positive. In 
this case we make the substitutions 

2irV 4 


E = 


r = 


Equation 641 then becomes 
dPR + 2dR 
dx 2 x dx 


k 2 h 2 

kh 2 

8:r V 2 ‘ 




6-37 

6-38 

6-39 


For very large values of x this is approximately 

d 2 R 
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for which the solutions are R = ce ±% *. The solutions of 6*39 are there- 
fore finite at infinity. The only possible difficulty is at the origin, 
where there is a singular point. If we let 

R = a 0 x L + aix L+l + a 2 x L+2 + • • • 641 

the indicial equation is found to be 

L(L — 1) + 2L — l (l + 1) = 0 642 

so that L = l or L = — (Z + 1). The solution beginning with L * l 
is finite at the origin and at infinity. There is therefore a solution of 
Schrodinger’s equation of class Q for all positive values of E. We thus 
have a continuous range of positive eigenvalues, corresponding to 
ionization of the hydrogen atom, as well as the discrete set of negative 
eigenvalues for the un-ionized atom. 



CHAPTER VII 


APPROXIMATE METHODS 

7a v perturbation Theory. The number of problems which can 
be solved exactly by the methods of quantum mechanics is not very 
large. This is not surprising if we recall that even in classical mechanics 
such problems as the three-body problem have resisted solution in a 
closed form. The great majority of the problems of quantum mechanics, 
including the problem of the structure of all atomic systems more com- 
plicated than the hydrogen atom, must therefore be treated by ap- 
proximate methods. The most important of these approximate 
methods, at least for our purposes, is the quantum-mechanical per- 
turbation theory. 

Suppose that we wish to solve the problem of the motion of a system 
whose Hamiltonian operator H is only slightly different from the 
Hamiltonian operator H 0 of some problem which has already been 
solved. Associated with H 0 we have a set of eigenvalues E% • • • 
En * * •, and the corresponding eigenfunctions ^?, $ • * • tZ * * * satis- 
fying the equation 

H 0 ^ = « 7-1 

Since by assumption H is only slightly different from H 0 , we write 

H = Ho + XH (1) 7-2 

where X is some parameter, and the term XH (1) , which is called a “ per- 
turbation,” is small in comparison to H 0 . The equation which we wish 
to solve is therefore 

(Ho + XH (1) )*„ = £„*„ 7-3 

If X is placed equal to zero, equation 7-3 reduces to 7-1, so that it is 
natural to assume that for small values of X the solutions of 7-3 will lie 
close to those of 7*1 ; that is, the effect of the perturbation XH (1) will 
be to change slightly the “ unperturbed ” eigenvalues E% and eigen- 
functions Now suppose that and E n are the eigenfunction and 
eigenvalue which approach and as X — ► 0; and for the present 
we shall assume that no two of the Ujj’s are equal. Since and E n 
will be functions of X, we may expand them in the form of a power 
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series as 

+n « *2 + A*«> + \Vf + • • • 7-4 

Bn = fiJJ + Ai’n 1 ’ + X 2 E® + • • • 7-5 

where \pn\ • • • ; E™, E„\ • - • are independent of A. If we sub- 
stitute these series in 7*3 we find 

Ho^ + X(H (1 Vn + Ho^ x) ) + A 2 (H (1 Vn ) + H 0 *f) + ••• 

= E°„i° n + A (E™i° n + E° n W) 

+ A 2 (E*V n + EjPtiP + EW») + • • • 7-6 

In order that this equation may be satisfied for all values of X the co- 
efficients of the various powers of X on the two sides of the equation 
must be equal. Equating the coefficients of the various powers of X 
gives the series of equations 

H 0 f° = E°J° n ' 7-7 

(H 0 - El)*™ = E ( M - H (1 V n 7-8 

(H 0 - El)^ = E®t° n + E?Y» - H (1 Vn > 7-9 

The first of these, by assumption, is already solved. If the second 
can be solved, we can find i/4 x) and E„K The solution of the third 
equation then gives and E®, and so on. 

In order to solve equation 7-8, let us assume that the expansion of 
the function 'Pn ) in terms of the normalized and orthogonal set of 
functions 'Pi, P 2 , ’ ' ' Pn , • ' ' is 

^ = ri 1 iA? + ri 2 ^ + --' + ri m ^ + .-. 7-10 

where the A m ’B are to be determined. The function H (1 ty° can also 
be expanded into the series 

H (1) ^ -Hjgfl + + • • • + + • • • 7-11 

where 

/^H W *°ndT \ 

Substituting these series into 7-8 gives 
(Ho - K) Ml + + •••) = B2W - - Hm • • • 7-12 

which can be reduced by means of equation 7*7 to 
C®? — E%)Ai4% + (Bs — E^)A 2 p a 4- • • • . 

= Bg>ti - um - ff&Vg ... 7-13 
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The coefficient of each \p° must be equal on both sides of the equation. 
On the left the coefficient of \p° is zero; on the right it is (25® — 22® ) ; 
so that 

E2> - = 0 7-14 

The first-order perturbation energy has thus been determined to be 

= 27< 1 n ) = fi° n *U w £dr 7-15 

By equating the coefficients of ^ (m ^ n), we obtain 

H a) 

(E° m - E° n )A m = -22®; A m = p - ^ p o 7-16 

This relation gives us the values of all the A’ s except A n . The coeffi- 
cient A n may be determined by the requirement that be normalized. 
We may express \j/ n as 

' in = i°n + X Z'A m f° n + \A n *° n + X 2 (- • •) 

m 

where means that we are to sum over all values of m except n. Then 

m 

f dr = dr + \Z'A m f vC* ^ dr 


+ \E'A m f M dr + 2 \A n J dr + X 2 (- • •) 7-17 


Since the functions ip° m are normalized and orthogonal, equation 7*17 
reduces to 

f dr = 1 + 2XA„ + X 2 (- • •) 7-18 


If the functions \j/ n are to be normalized, the right side of this equation 
must be equal to unity for all values of X, so that we must put A n 
equal to zero. The results to the first order in X are therefore 

E n = E° n + Xff® + x 2 (- • •) 7-19 

o-U) 

- *n + XL' p0 m V + X 2 (- • •) 7-20 

m F J n &in 

We obtain and 25® by a similar process. If we assume that 
PT - Blip i + B 2 ip2 + • • * + BmPm + * ' • 


7-21 
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where the B * s are to be determined, then 

(H 0 - E° n )t™ * (B? - EDB^ + (E 2 - + 

From equations 7*10, 7*11, 7*15, and 7*16, we have 

//d )//(!) 

m 

H “ v?,= 5W^k H " v ” 

m ti n A 

rj(l)rja) 

_ ^ / LJ km LL mn ,0 

“ 2^2-, wj 

A m ti n — ti m 

so that, with the use of these results, equation 7*9 becomes 

E (®2 - + E' vC 

A m tin — ti m 


722 


723 


7*24 


r/(l)z/(l) 

V'^T* / km 1A mn ,o 7 f>K 

2-*2l, pO tttO ^A 
Am ti n — 


If we now equate the coefficients of ^ on either side of the equation, 
we find 

TriDrid) 

0 - E™ 7-26 

m x!/rt ti m 

or 


tt(1)it(1) 
p(2) v - ' / xi nm xi mn 


^ = E' 


tttO t;tO 

tin ti ~n 


\ 


Equating coefficients of (/c ^ n) gives 


rr(l) r/fl) rr(l)rr(l) 

/ 77*0 27*0\ r> xx nn xx An t xx km 11 mn 

\ti k — ti n ) B k - -5 ~o - 2- 


so that 


B* = E' 


< - £2 


EtO 7tt0 

ti n — ti m 

Tl(l) Zj(l) 

xx rm xx An 

(E° n ~ E° m )(K~ E° k ) (K~E ° k ) 2 


7-27 


7-28 


7-29 


The normalization of requires that vanish. The results, correct 
to the second order in X, for the energy levels and wave functions are 
therefore 


tjWtt(I) 

E n = 2ff + XffW + x 2 E / - pd - ? -- lg + x 3 (- • 0 + 

m tin xS„, 


7-30 
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4>n 


*2 + xr 

m 



+ x 2 



(K-E° k )(E° n -E° m ) 


1 

(E° n -E° k ) 2 J 


** + x 3 (- • •) + • • • 

7-31 


In most applications it proves convenient to absorb the parameter X 
into the function H (1) , in other words, to place X equal to unity in the 
above equations. * W 

7b. Perturbation Theory for Degenerate Systems. In the last 
section we made the assumption that the unperturbed energy levels 
En were all different. Problems frequently arise in which two or more 
orthogonal eigenfunctions have the same eigenvalue. Such eigen- 
values are called degenerate. Suppose, for example, that 


and 


Hi/'! = E\p\ 

7-32 

H^ 2 = Eyp 2 

7-33 


Then, if Cj and c 2 are any constants such that c*ci + c 2 c 2 = 1, 


+ ^ 2 ) ~ + £ 2 ^ 2 ) 7-34 


so that Ci^i + c 2 \[/ 2 is also an eigenfunction. Hence, if there are two 
eigenfunctions having the same eigenvalue, there are an infinite num- 
ber of eigenfunctions having that eigenvalue. 

A set of n eigenfunctions is said to be linearly independent if there 
is no relation of the type 

cvPi + c 2 \p 2 + • * * + c n yl/ n = 0 7*35 

connecting them. In the above example there are no three eigen- 
functions which are linearly independent since all the eigenfunctions 
are expressible in the form Ci^i + c 2 ^ 2 - If the number of linearly 
independent eigenfunctions corresponding to a given eigenvalue is n, 
the eigenvalue is said to be n-fold degenerate. Once a set of n linearly 
independent eigenfunctions has been chosen, any other eigenfunction 
with the given eigenvalue can be expressed in the farm 

f'l'Pl + Cl'P'i + • • • + C'n'Pn 7*36 

where the c’s are constants. 

Now suppose that we wish to find the solution of 

(H 0 + XH (1) ty - Etp 


7-37 
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where the eigenvalue approaches an m-fold degenerate eigenvalue of 

H 0 ^° - EY 7 *38 

as X approaches zero. With no loss of generality we may assume that 
the m linearly independent eigenfunctions corresponding to this eigen- 
value are $ • • • i° my so that B? - £§=... = E® m . We may also 
assume that have been made orthogonal. As X approaches 

zero, ^ must approach some solution of 7*38 whose eigenvalue equals 
Bj, that is, some linear combination 

<P = Cl*? + c 2 *? + • • • + c m *° 7-39 

where the c’s are constants. This linear combination may be spoken 
of as the “ zeroth-order ” approximation to *. The expansion of E 
and * in powers of X must therefore be of the form 

E = £? + XE m + \ 2 E i2) 740 


* = E ctfj + x* (1) + xv (2) + 

J'-X 


741 


Substituting these expansions in 7-37 and equating coefficients of like 
powers of X gives 


H 0 E ctf =E° 1 E erf 

i =1 i=i 

m 

(Ho - £?)* (1) = E Cj{E w - H (1) )*y 

/-I 


742 


743 


Since £? =£?=••• = equation 742 is already satisfied. As 
before, let us put 

* (1) = E A^f, H (1) *y = E HjyV? 744 

i k 

where H$ = J' *?*H (1) *y dr, and the A's are constants to be deter- 
mined. Then 

£ cyH<i>*9 = £ E 

y-i i-ik 


= E E cM'ti 

i-1 

Substituting these expressions in 743 gives 

E(£? - s?)A^ = E - E(E c*Hj|>)^ 

i-i i *-i 


745 


746 
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If j > m, equating the coefficients of 4$ on both sides gives 

(Ej - E'l)A i = E m c, - £ Hj? c k 7-47 

k - 1 

But, for j < m, E J* = so that 

m 

EMjkC/c - £ (1) c y = 0 7-48 

fc-i 

We thus have the system of m simultaneous equations for the c/s 

(//IV - E (1) )a + llnc-i + • • • + U\"f m - 0 

IlXtcy + (H®-E w )r, + •••+ U®c m = 0 


//<&, + H®p . 2 + • • • + (//”! - /? a, )c m = 0 7-49 

A possible solution of this set of equations is c* = c 2 = • • • = c w = 0. 
According to a theorem of algebra (Appendix IV) this is the only solu- 
tion unless the determinant of the coefficients of the c’s vanishes, that 
is, unless 

(//IV - E m ) II® ••• II® 

II® (11® -E w )--- H® 

■ = 0 7-50 

mil hui ••• (hz-e w ) 

This is known as the secular equation. Since the Z/jjV’s are known 
constants, it is an equation of the mth degree in E (1 \ and therefore 
has m roots. Let these roots be E®, E®, • • • E®. Unless some of 
these roots happen to be equal, there are therefore m different perturbed 
states whose energies approach E J as X approaches zero. 

In order to find the eigenfunction corresponding to the root El l \ 
we substitute this value for £ (1) in the set of equations 7*49 and solve 

for the ratios A knowledge of these ratios, plus the nor- 

C\ Ci 

malizing condition 

c* c i + c|c 2 + * * * + c*c™ = 1 7*51 

is sufficient to determine completely the c’s and hence the zeroth-order 
eigenfunctions <pi , the subscript indicating the root of 7*50 to which the 
eigenfunction corresponds. 
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The first-order eigenfunction may now be found by equating the 
coefficients of the remaining $ (j > m) in equation 746. The result is 

% 

(£? - EftAi = - E ctHjP; At = fg 7-52 

In order that \p be normalized we must put Aj = 0 0* < m). Hence, 
if is the eigenfunction whose zeroth-order approximation is tpi , the 
first-order perturbation theory gives 

m 

E c k II$ 

ii - <pi + X E 775 ToT fy + • •) 7*53 

]>m ~ Uj) 

E t = E°i + XEP + X 2 (- • •) 7-54 

for the perturbed eigenfunctions and eigenvalues. 

7c. The Variation Method. Another, completely different, method 
of finding approximate solutions of the wave equation is based upon 
the following theorem: If <p is any function of class Q such that 

JVv dr = 1, and if the lowest eigenvalue of the operator H is E$ f then 

J <p*H*> dr > E 0 7-55 


The proof of this theorem is very simple. Consider the integral 



Eq)<p dr ~ 



7-5G 


If we expand the function <p in a series of the eigenfunctions fa, • * * 
ypi • • • of H, we will have 


/ >C H - Eoh dr = fiZcffi) (H - £'o) (£*#<) dr 7-57 


Since the fa ’ s are eigenfunctions of H, H^,- = Etfi, and so equation 
7-57 becomes 



— E 0 )<p dr = 


f (EcJtf 

i 


)(E (Ei - EMi) dr 


= E ctei(Ei - E 0 ) 

i 


7-58 
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Now c*Cj is a positive number, and by definition Ei > E 0 , hence 


and 


f W H - E 0 ) V dr>0 

7-59 

J' <p*Hip dr > E 0 

7-60 


The equality sign can hold only when <p = f 0 , where is the eigen- 
function with the eigenvalue E 0 . 

The method of applying this theorem is equally simple in principle. 
A trial eigenfunction <p(\ 1 , X 2 , • • • ), normalized to unity, is chosen, this 
trial eigenfunction being a function of a number of parameters Xi, 

X 2 , • • • . The integral J — J dr is then found. The result will, 

of course, be a function of the parameters Xj, X 2 , • • • . The integral J 
is then minimized with respect to the parameters. The result is an 
approximation to the lowest eigenvalue, and the corresponding <p is 
an approximation to the corresponding eigenfunction. By taking a 
sufficiently large number of parameters in a function of a well-chosen 
form, a very close approximation to the correct eigenvalue and eigen- 
function can be found. 

In simple cases, and with a judiciously chosen trial eigenfunction, the 
results of the variation method are identical with the results obtained 
by the solution of the Schrodinger equation. As an example, let us 
consider the harmonic oscillator, for which the Hamiltonian operator is 


h 2 d 2 k 2 

8ir 2 m dx 2 2 X 


7-61 


If we try <p = ce x **, the condition that <p be normalized is fulfilled if 
4/2X ■ 

we put c = * / — . Then 


V T 


H* = — A- (4xV - 2X)e- x *’ + ^ **«-*•* 

8irm 2 

and 

J = (4X 2 * 2 - 2X)e~ 2Xx> + ~ x 2 e- 2X * , l dx 7 62 

L or m 2 J 

Upon evaluating these integrals (Appendix VIII), we obtain 

h 2 \ . k 
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The condition that J be a minimum is 

dj _ _ _fc_ _ 

d\ ~ 8t r 2 m 8X 2 “ 



7-64 


so that the lowest eigenvalue is 


E, 


47 r \ m 


and the corresponding eigenfunction is 


<p = 



7*65 


7*66 


The fact that these results are identical with those of section 5e is, of 
course, due to the “ well-chosen ” form which we took for our trial 
eigenfunction. 

It is also possible to use the variation method for the calculation of 
energy levels and eigenfunctions for excited states. After we have 
obtained an approximation to the ground state by the above method, 
we choose a second trial eigenfunction which is orthogonal to the one 
that we obtained for the ground state. A repetition of the above pro- 
cedure will then give us an approximation to the first excited state. 
This process may be continued indefinitely, although the errors involved 
will be cumulative. 

7d. The Ground State of the Helium Atom. As a further example 
of the use of these approximate methods we shall calculate the energy 
of the ground state of the helium atom both by the application of the 
first-order perturbation theory and by the variation method. The 
Hamiltonian operator for the helium atom (or for other two-electron 
atoms such as Li + , etc.), is, if we neglect the terms arising from the 
motion of the nucleus, 


H = 


h 2 

8ir 2 m 


(V? + V 2 2 ) 


Ze 2 _ Ze 2 + 6^ 
r\ r 2 r l2 


7*67 


where Vi and V 2 are the Laplacian operators for electrons 1 and 2; 
ri and r 2 are the distances of these electrons from the nucleus; r 12 is 
the distance between the two electrons; and Z is the nuclear charge. 
For the purpose of simplifying the calculation of the integrals involved 
in problems of this type it is usually more convenient to use atomic 
units. The transformation to atomic units is obtained by expressing 


distances in terms of the Bohr radius clq = 


h 2 

4i r 2 me 2 * 


We therefore 
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^2 j ^2 

write ri = a 0 /2i, r 2 = ao# 2 , r 12 = a 0 Ri 2 , ^ ^ ^ > etc- With 

this transformation, the Hamiltonian operator becomes 

ft 2 1,,. Ze 2 -> 2 


H = - 


87r 2 m a 2 


(v? + v|) 


Ze 2 e* 


a 0 ^i QqR 2 (hRi2 


7-68 


or, in units of — > 

do 

H 


1,2 2x -Z ^ , 1 

-i(v? + v|) - — - — + — 

ill K>2 **12 


7-69 


In applying the methods of the perturbation theory to this problem 
we set H = H 0 + H (1) , where 

Z Z 


Ho= -4(vf +V|) - 


Ri R 2 


7-70 


H (1) = 7-71 

R 12 

The zeroth-order eigenfunctions are the solutions of the equation 

Hot 0 = E°t° 7-72 

If we now set t° = ^°(1)^°(2), E° = E°( 1) + E°( 2), equation 7-72 is 
immediately separable into the two equations 

W(i) + ( e °(1) + Jr)*°(i) = o 

7-73 

iv|*°( 2 ) + ^°( 2 ) + J-)m = 0 


These equations are just those for a hydrogenlike atom with nuclear 
charge Z . For the ground state of the helium atom, we therefore have 

*°(1) = _L z SA e~ ZRl t°(2) = 4= Z n e~ ZRl 

VJ r v 7T 

= ^°(1)^°(2) = — e - Z(fil+K!) 7-74 

7 r 

£° = #°(1) + £°(2) = 2Z 2 E’ la (H) 

where 2?i,(H) is the energy of the ground state of hydrogen, and, in 
e 2 

ordinary units, is — & — . The first-order correction to the energy is, 

Oq 
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according to equation 7*15 

E w = f f *°*H< V dn dr 2 

e 2 Z Q C n q~ 2ZR ^q~ 22R ^ 

-*„ 7 ‘J J g„ dr ' dn 

where 

dri = R 2 sin 0i dRi dO x d$i 
dr 2 — R 2 sin 02 dR 2 d& 2 d$ 2 


7*75 


Written in the above form, the integral cannot be evaluated because 

of the presence of the term —• . According to the theorem proved in 

R\2 

Appendix V, this quantity can be expanded in terms of the associated 
Legendre polynomials as 

k - ? 5 Utm! w p! " ( “ s 0,) p| " <cos 0j) ™ 


where is the smaller and R > is the larger of the quantities R\ and 
R 2 . The wave functions themselves do not involve the angles ex- 
plicitly; in other words, only the constant functions Po (cos 0i) and 
P° 0 (cos 0 2 ) are involved in the wave functions. Since the associated 
Legendre polynomials are orthogonal, all the terms in the summation 
will vanish except those for l = 0, m = 0. For these terms P 0 (cos 6) 
= 1, so that equation 7*75 reduces to 


E (1) -= 


a 0 7 r 2 



e -2ZRi e -2ZR2 

P~> 


dri dr 2 


7*77 


The integration over the angles gives a factor (47 r) 2 , so that we have 
only the integral over Ri and R 2 , which may be written as 

E m = 16 Z 6 — f e~ 2ZRl I" y- f ' e~ 2ZRi R 2 dR 2 

LRi Jo 

+ f e~ 2ZRi R 2 dR 2 

J Ri 

which may be evaluated in a straightforward manner to give 


R\dR x 


7-78 


E m = | z — 

8 Oo 


7-79 


To the first order, the energy of the lowest state of helium (or helium- 
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like atoms) is therefore 

E - E° + E w = (2 Z 2 - \Z)(- \ — ) 

\ 2 oo/ 7>80 

= (2 Z 2 - \Z)E U QL) 

The energy of the ground state of He + is Z 2 E u (Jl). The first ioni- 
zation potential of helium, that is, the energy necessary to remove one 
electron from the atom, is thus calculated to be 

( Z 2 — %Z)E U (H) = = f (13.60) = 20.40 electron volts 

The observed value is 24.58 e.v., so that our calculated value is in error 
by 4.18 e.v. or about 16 per cent. Our results will look better if we 
compare the calculated and observed total binding energy. These 
values are: calculated -^(13.60) = 74.80 e.v., observed 78.98 e.v. — 
an error of 4.18 e.v. or about 5 per cent. The actual error is the same 
in both cases; the percentage error is, of course, decreased in the 
latter case./ 

For the two-electron atoms Li + , etc., the percentage error is consid- 
erably less than for helium, since the interactions between the electrons 
and the nucleus become relatively more important than the interaction 
between the electrons. In 7*80 we see that for helium the perturba- 
tion energy is of the zeroth-order energy. As this is certainly not 
a “ small ” perturbation, we should not be greatly disappointed at the 
failure of the first-order perturbation theory to give more accurate 
results. We shall now show how the binding energy of the helium 
atom can be calculated to any desired degree of accuracy by means of 
the variation method. 

In order that we may choose a reasonable trial eigenfunction for use 
in the variation method, let us consider that one of the electrons of the 
helium atom is in an excited state and the other in the ground state. 
The electron in the ground state is subjected to the full attractive force 
of the nucleus, so that ^°(1) should be essentially the same as in 7*74. 
The electron in the excited state, however, moves essentially in the 
field of a nucleus of charge e, as the (Is) electron screens the nucleus 
more or less completely, and so ^°(2) for an excited state should ap- 
proximate more closely a hydrogen wave function with Z = 1. These 
considerations suggest that a good trial eigenfunction would be 

* = — e - z '^ +Ri) 7-81 

7 r 

where Z' is between 1 and 2, the best value being determined by min- 
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inuring the energy with respect to Z'. The functions written above 
are the solutions of the equation 

H Q<p = E'vp 7*82 

e 2 

where, in units of — , 
a ® 

E'o = 2Z' 2 E u (R) 

We must evaluate the integral 

E - //„* H, dri dr 2 7*83 


where H is given by 7*69. We can write H <p as 


H<p 


= [-J (V? + vf) - f; - fj]* - - Z ') ( 

= E^ - (Z - z') + -}-)<? + ±-<p 

\K 1 «2/ «12 


#1 


+ 






7-84 


so that the integral 7-83 is reduced to 
E = E' 0 - (Z - Z')[f J dr 2 + f f £dn dr 2 ] 

+ f dr 2 7-85 

*/ t/ 1^12 


The two integrals in the brackets are equal, as they differ only in the 
interchange of the subscripts. The first of these is 

16ir 2 ^4 f e~ 2Z Rl Ri dR x f e~ 2Z ' R 'R% dR 2 = Z' 7-86 
t 2 J o *^o 

The last integral is identical with 7-75 if Z is replaced by Z' . The 
result for the energy is thus 

e 2 e 2 

E = E' 0 - 2 Z'{Z -Z')- + fZ' - 
do do 

= [2Z' 2 + 4 Z\Z - Z') - %Z']E U ( H) 

= [-2Z' 2 + 4 ZZ' - $Z']E ls (H.) 7-87 

According to the theorem of the variation method, we obtain the best 
approximation to the true energy by giving Z' the value which will 
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'j*l 

make the energy a minimum. This condition requires that 
fiW 

— , = (-4Z' + 4Z-f)J5 la (H) = 0 
or 

Z' = Z - 7-88 

Substituting this result in 7*87, we obtain for the energy the value 

E = 2Z' 2 E l8 (R) = 2 (Z - -3r) 2 E u (R) 7*89 

For the first ionization potential of helium we therefore obtain 
[2(f£) 2 ~ 4]Z?is(H) = 1.695(13.60) = 23.05 e.v. 

The discrepancy between calculated and observed values is thus re- 
duced to 1.53 e.v., or about 6 per cent. The total binding energy is 
calculated to be 77.45 e.v. as compared to the experimental value 
78.98 e.v., or an error of about 2 per cent. 

By introducing more parameters into the trial eigenfunction <p, we 
can approach more and more closely to the experimental result. Par- 
ticularly good results are obtained, even with quite simple trial eigen- 
functions, if the variable Ri 2 is explicitly introduced into the trial 
eigenfunction. For example, Hylleras, 1 using the trial eigenfunction 

<p = A { e- z '( Rl+Ri) (1 + cR 12 ) } 7-90 

where A is the normalization factor and Z ' and c are adjustable con- 
stants, obtained a value for the ionization potential which was in error 
by only 0.34 e.v. By using the more general function 

<p = A { e - z '( R i+ R d (polynomial in R\ y R 2j R\ 2 ) } 7-91 

he was able to reduce the error still further. The function involving 
a polynomial of fourteen terms gave a value for the energy which agreed 
with the experimental value within 0.002 e.v. It is thus apparent that 
the accuracy of the results obtainable by the variation method are lim- 
ited only by the patience of the calculator, a very important limitation, 
however. In our later work, we shall therefore be satisfied with a smaller 
degree of accuracy in most of our numerical calculations. 

1 E. Hylleras, Z. Physik , 65, 209 (1930). 
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8a. Time-Dependent Perturbations. In the previous chapter wc 
discussed the problem of determining the new energy levels and wave 
functions for a system subjected to a perturbation which depended only 
on the space coordinates of the system. For certain problems, particu- 
larly those dealing with the emission and absorption of radiation, we 
need to calculate the effects produced by a perturbation which is a func- 
tion of the time; therefore, we shall now develop the time-dependent 
perturbation theory. 

The wave equation, in the form which expresses the manner in which 
the complete wave function \I> changes with time, is 


H* = - 


h d* 

2iri dt 



81 


where ft = — . Let us now write the Hamiltonian operator as 

H = H 0 + H', where H 0 is independent of time and H' is a time- 

dependent perturbation. The unperturbed eigenfunctions satisfy 

the equation 

d'I > 0 

H 0 *° = ih 8-2 

dt 

__ . En 

and are of the form \^(g, t) = \I>°(g)e % h • In order to obtain a solu- 
tion of 8*1 we expand the function \I> in terms of the unperturbed eigen- 
functions with coefficients which are functions of the time; that is, 
we write 

*(?, 0 = £*.(0*2(5, 0 8-3 


Substituting this expression for in equation 8*1 gives 


Ho*" + IXH'*” = iffc -2 *» + ihT.Cn ~ 

n n n Qt n (XI 


84 


Since the unperturbed eigenfunctions satisfy equation 8-2, the above 
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fih ♦ 


lation immediately reduces to 


n n CLt 


8-5 


Let us now multiply both sides of this equation by and integrate over 
coordinate space. Then 



In any particular problem we will thus have a set of simultaneous differ- 
ential equations which can be solved to give explicit expressions for the 
c»’s. 

8b. The Wave Equation for a System of Charged Particles under 
the Influence of an External Electric or Magnetic Field. The most 
important problem to which the time-dependent perturbation theory 
will be applied is that of radiation. In order to discuss radiation theory 
we need the Hamiltonian operator for a charged particle in an electro- 
magnetic field. In deriving the classical Hamiltonian function it is 
more convenient to use the vector potential A and the scalar potential (p 
rather than the electric and magnetic field strengths E and H. The 
relations between these quantities are given by the equations 


H = V*A; E = - - - A 
c dt 


V<p 


8*8 


where c is the velocity of light. 

A particle of mass m and charge e moving with a velocity v in an elec- 
tromagnetic field is subjected to a force 


F = e(E + - [v*H]) 
c 


The equations of motion are therefore 


m 


d 2 x 
dt 2 


d<p e dA x 
6 dx c dt 


e ( dy TT dZ Tj\ 


8-9 


d 2 y d 2 z 

with similar expressions for and Using the relation H = V*A, 


these equations become 
cPx dtp 


m = — e ■ 
dt 2 dx 


fdy/dAj, 

c dt ^ c _dt\ dx 


e dA x e 

+ 


dAx 

dy 

+ 


) 


dz 


(dA,_ 


dA, 


dt \ dx dz 


’)] 


8-10 



THE WAVE EQUATION 


109 


etc. It is not difficult to demonstrate that these equations of motion 
are derivable from the Lagrangian function 


mV(dx\ 2 ( dy\ 2 / cfe\ 2 ~l 

= aLU/ + W + WJ 


e / dx dv dz 

+ -ck-t A - + i A ' + di A - 


) 


etp 8*11 


dL 


From the definition of generalized momentum, pi = — , we see that 

d$i 

dx e 

== m — + ~ A x , with analogous values for p y and p z . The Hamil- 
dt o 

tonian function is therefore 

-![(!)’ +(f) !+ (!)’]+- 8 - 12 

or, in terms of coordinates and momenta, 

h - 1 [( p * - i A )‘ + (* - + ( * • - \ a ) ] + *- 13 

The procedure for constructing the Hamiltonian operator is identical 
with that followed before; in the classical Hamiltonian function the 

fi d d 

momentum p x is replaced by — ; — = —ih — , etc Operating on a 

27 n dx dx 

wave function the first term in the Hamiltonian gives 

t 2 


m \ dx c / 
2m \ 


fi 2 j~2 + ih - r- Ax + ** ‘ T~ + ~ 

dx C dx c dx c 


K| 2 )* 


1 l . * edA * T 

= — ( -fr — o + ih- — ^ 
2 m\ dx 2 


C dx 

*4“ ih ~ A x 


dx 


+ ih - A x — + 


,2 

dx ^ C 2 


K| 2 *) 8-14 


After collecting terms, and expressing our results in vector notation, we 
see that the Hamiltonian operator is 

H = (~h 2 V 2 + ih-V - A + 2ift-A • V + ^ Ia|A + op 8-16 

2 m\ c c c * 1 / 
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For an electromagnetic field such as that associated with a light wave, 

V • A = 0 and <p = 0. Since the perturbation of a system by a light 

2 

wave will be a small perturbation, the term |A| 2 may be neglected in 

c 

discussing radiation, although it must be retained when discussing per- 
turbations due to strong magnetic fields. For a system of charged 
particles, with an internal potential energy V, we will therefore have 
the Hamiltonian operator H = H 0 + H ; , where 

Ho = ~E + V; H' = E — ihAj ■ ^ 8-16 

j 2m, j j mjc 

The operator H° is just the operator for the system in the absence of an 
electromagnetic field; the perturbation H' may equally well be \yritten 

asH'= - E— Ay-R,. 

J rrijC 

8c. Induced Emission and Absorption of Radiation. Let us con- 
sider an atomic or molecular system subjected to the perturbation H' of 
an electromagnetic field. For simplicity, we first will assume that the 
field is that of a plane-polarized light wave with A y and A z equal to 
zero.* We shall need to calculate the values of such matrix elements as 

f <*H'< dr = (*r|H'|*°) = (<* I - E — A-xiPxi I *2) 8-17 

\ I j HljC I / 

Since molecular dimensions are of the order of 1/1000 the wavelengths of 
visible light, a sufficiently good approximation for our present purposes 
will be to take A as constant over the molecule. The matrix elements 

* That this situation represents a plane-polarized wave may be seen as follows: 
We take A = L4 x with 

A x = A\ cos 2 ? tv ^ t ^ 

This represents a wave moving in the z direction with a velocity equal to c. The 
associated electric and magnetic fields are then 

E = -- |a = i — Alein2^(t-- S ) 
c dt c \ e/ 

H = VxA = j^Ajsin2^^ -0 

The electric and magnetic fields therefore have equal amplitudes and are at right 
angles to each other as well as to the direction of propagation, and hence represent 
a plane-polarized light wave moving along the z axis with a velocity c. 
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can then be written as 

C j Tttj 0 If j Wij \ JviCyJ f 

e h t 1 / J d I A 

= -~-.A x e * E — ( — K ) 8 ‘ 18 

c i j ritj \ \dXj\ ) 

In order to obtain usable results, we wish to express the matrix elements 
of — - in terms of those of Xj. Let us for the moment consider that the 

dXj 

*’s are functions only of one coordinate x. Then and tn satisfy 
the equations 

2m 

-£r + -tf[E m - V(x)] t° m * = 0 8-18a 




8*18b 


If we multiply the first of these equations by x\pn and the second by 
and subtract, we obtain, upon integrating the resulting equation, 

(E - - *- 18 ' 


The first two terms may now be integrated by parts; since the wave 
functions vanish at infinity, we have 

#r ±,_. 0 

dx 


) 


- — (a ^°* ) 

J ~ KVm> dx 


f] 


O yy) f* 

= ^(E m - E n ) j * « dr 8-18d 
or 

II («* (£ - - S'. *-**' “• 818e 

of we integrate the first term by parts, we see that it is equal to the 
second. For this one-dimensional example, we have thus obtained 
the result 


|£b) = ~¥ {Em ~ 


8-18f 


This result can be generalized. The matrix element may 

therefore be written as 


«*|H'K) = -~A x l -(E m - E n )X n 
c n 


. E m —E n 
0 * 


8*18g 
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where X mn = is the matrix element for the x compo- 

i 

nent of the dipole moment. 

If we now assume that the system was originally in the state n , 
so that c n = 1 and all the other c’s are zero at the time t = 0, we have, 
for times sufficiently small so that all the c's are negligible except c n : 


dcm 

dt 



. E m - m En A 

E n )e « 


8-19 


If the light has the frequency v, the time dependence of A x may be 
expressed as 

A x = A x cos 2 t wt = \ A x (e 2vivt + e~ 2iriyt ) 


Then 


dc m 

dt 


1 [ j Em-En-t-hv . E m —E n ~hv \ 

^A° x X mn (E m -E n ) \e * +e * } 8-20 


Integrating, and choosing the constant of integration so that c m equal 
zero when t ~ 0, we have 


■A° x X mn (E„ 


.Bm—En—hv 


(E m — E n + hv ) 


Let us consider that E m > E n , so that the transition corresponds to 
absorption. c m will be large only when the denominator of the second 
term in brackets is nearly zero, that is, when E m — E n ^ hv, The 
probability that the system will be in the state m at time t will be the 
value of the product c*c m . The first term in brackets being neg- 
lected, this product is 




SH \A° x \ 2 \X n n\ 2 (E m - E n ) 2 - 


2 f Em Efi hv 

1 2 h 

Em - E n - hv } 2 
2 h I 


So far we have considered only a single frequency v . To obtain the 
correct value of c*c m we will have to integrate over a range of fre- 
quencies. Since c*c m is very small except for frequencies such that 
E m — E n ^ hv, it will be satisfactory to integrate from — » to + oo 
and regard A x as constant and equal to A x (v mn ). This integration gives 

£c m = ^ \A° x ( Vmn )\ 2 \X nn \ 2 t 8-23 


where E m — E n has been replaced by hv mn . 
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Equation 8-23 has been derived upon the assumption that the light 
was plane polarized. In the general case in which A y and A z are not 
zero, equation 8-23 will, of course, contain additional terms in A v and 
A t . In calculating c* t c m the cross terms will vanish because of the 
randomness of the phase differences between the various components 
of A; the final expression for c*c m will therefore be 

clcm = {\Al{v mn )\ 2 \X mn \ 2 + \A° v (, mn )\ 2 \Y mn \ 2 

+ U“(, m „)i 2 |Z m „| 2 }< 8-24 

If the radiation is isotropic, then 

\A° x (v mn )\ 2 = 1 A° v { Vmn )\ 2 = \A% mn )\ 2 = i|ri°(,w)| 2 8-25 

We may express \A°(v mn )\ 2 in terms of the radiation density p(V m „). 
From Equation 8-8, 

E (v mn ) = ^ A°(v mn ) sin 2irvt 

c 

Then 

O 2 2 

E 2 (v mn ) = ^ | A \ Vnn )\ 2 
c 


since the average value of sin 2 2nvt is In electromagnetic theory, it 
1 


is shown that p(v mn ) = — E 2 { v mn ), so that \A x (v mn )\ 2 

47 T 


2 P ( v mn ) • 


The product c^c m may now be written in terms of the radiation den- 
sity as 

2tt 


c* m c m = ^- 2 { \x m 


+ Ymn 2 + | Z mn 2 }p(v mn )t 


8 -: 


Since the probability that the system will be in the state m is zero at 
time t = 0, and is the value given by 8*26 at time t, the probability 
that a transition from the state n to the state m will take place in 
unit time, resulting in absorption of energy from the electromagnetic 
held, is 

Bn—^mP (ymn) = ^2 I^mnj 2 p0'mn) 8*27 

where 

\R mn \ 2 = \X mn \ 2 + | Fm„| 2 + |Z m „| 2 8-28 

If the system is originally in the state m, then the same treatment 
shows that the probability of transition to the state n, resulting in the 
emission of energy, due to the perturbing effect of the electromagnetic 



114 TIME-DEPENDENT PERTURBATIONS: RADIATION THEORY 


field, is 


Em — >nP (ymn ) E n — >mP ( v mn) 


8*29 


8d. The Einstein Transition Probabilities. The coefficients J9 w _* n 
and B n _+ m are known as the Einstein transition probability coefficients 
for induced emission and absorption, respectively. Since a system in 
an excited state can emit radiation even in the absence of an electro- 
magnetic field, the completion of the theory of radiation requires the 
calculation of the transition probability coefficient A m -+ n for spon- 
taneous emission. The direct quantum-mechanical calculation of 
this quantity is a problem of great difficulty, but its value has been 
determined by Einstein 1 by a consideration of the equilibrium between 
two states of different energy. If the number of systems in the state 
with energy E m is N m , and the number in the state with energy E n is 
N n , then the Boltzmann distribution law states that at equilibrium 


Nrn 

N n 




8-30 


where T is the absolute temperature and k is the Boltzmann constant. 
Expressed in terms of the transition probability coefficients discussed 
above, the number of systems making the transition from m to n in 
unit time is 

>n ~{~ E m — +nP(?mn) } 


Similarly, the number of systems making the reverse transition is 

N n E n ^ m p{y mn ) 

Since at equilibrium these two numbers are equal, we have, after elim- 
TSf m 

mating the ratio — by means of equation 8*30, 
iV n 


e 


hvmn 

hr — 


En — >7 nP (ymn ) 

Am — + B m—*n p(?mn) 


or 

Using relation 8*29, 


P ( ymn ) 


__ 

A p hr 

hvmn 


8*31 


— B m _+ n e kT + B n _+ m 

the energy density may therefore be written as 


P\?mn) - " g 

e kT — l 

1 A. Einstein, Phydk . Z. t 18, 121 (1917). 


8*32 
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The energy density in a radiation field in equilibrium with a black body 
at a temperature T may be calculated by the methods of quantum 
statistical mechanics (Chapter XV) and, as mentioned in Chapter I, 
is given by Planck’s radiation distribution law as 

P(» w) = 1 ^— 8-33 


h v mn 

e kT - 1 


Comparing 8-32 and 8-33, we see that 

Sirhv^ 


B n 


or 


32t r 3 ^ r 
3 c s h 


| Rn 


8-34 


To the degree of approximation used above, the coefficient for spon- 
taneous emission depends only on the matrix element for the electric 
dipole moment between the two states. If the variation of the field 
over the molecule is not neglected, there will be additional terms in the 
expression for ri w _» n , the first two additional terms being those corre- 
sponding to magnetic dipole and electric quadripole radiation. In- 
cluding these terms gives 2 

( I e I \| 2 

+ T^r’ 1 ' 3 ~pr I (wi|err|n)| 2 j 8-35 

We may estimate the relative orders of magnitude of these terms as 
follows. Disregarding the constant term, we have 

| (m|er|n)| 2 ~ ( ea 0 ) 2 ~ 6-5 X 10 -36 c.g.s. 

— — rxpjn')) ~ (z~) ~ 8-7 X 10~ 41 c.g.s. 

2 me | )\ \2mc ) 

2 

■^ir 3 ^p |(m|err|n)| 2 ~ 6-8 X 10 -43 c.g.s. (X = 5000 A) 
c 

We thus see that the probability of transition due to magnetic dipole 
or electric quadripole radiation will be negligible in comparison to the 
probability of transition due to electric dipole radiation. The higher 
terms in 8*35 will therefore be of importance only in those cases in which 

2 E. U. Condon and G. H. Shortley, The Theory of Atomic Spectra , p. 96, Cam- 
bridge University Press, 1935. 



327T 3 . 3 , 

3c 3 h 


|(m|er|rc)| 2 + 
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the electric dipole matrix element ( m\ei\n ) vanishes because of the 
symmetry properties of the states m and n. 

The actual intensity of radiation of frequency v mn due to sponta- 
neous emission will, of course, be 

liymn) — N m (JlV mn )A m—*n 8-36 

8e. Selection Rules for the Hydrogen Atom. According to the 
results derived above, the only transitions of importance in the hydro- 
gen atom will be between those states a and b for which 

(ajerjb) = e{i(aja;|&) + ){a\y\b) + k(«|z|6)} 

is different from zero. The eigenfunctions for the hydrogen atom may 
be written as 

in,i, m =Kr)P\ m Kco S e)e im * 

so that we must investigate the values of such integrals as 

/Vn , l , mZ'I't', I\m' dr = (* ypn, l, in (r cos 0)^*> t v # dr 


This integral may be written as the product of integrals in r, 0, and <p. 
The integrals in r will be non-vanishing, so that we may concentrate 
our attention on the integrals over the angular coordinates. For the 
z component of the electric dipole moment we must therefore investigate 
the integral 


J Pj ml cos e p]!"' 1 sin 9 de J e i(m - m ' ) * cbp 


8-37 


From the recursion formulas for the associated Legendre polynomials 
(equation 4-85), we have 


cos 


OPl?' 1 = 


V - \m'\ -(- 1 

2 1' + 1 


p|m'i 


+1 


+ 


l' + 


m 


21 ' + 1 


m 


8-38 


The integral 8*37 is therefore non-vanishing only if 

m = m'; l — l' ± 1 8-39 

These relations are the selection rules for the emission of light polarized 
in the z direction. Rather than calculate the matrix components for 
x and y separately it is more convenient to calculate those for the com- 
binations (x + iy) = r sin 6 e tv and (x — iy) = r sin 6 e~" p . For the 
combination (x — iy) we have the integrals 


J P] m| sin 9 Pjr'l sin 9 de J d<p 8-40 
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The integral over <p is non-vanishing only if m = m + 1. From the 
recursion formula 4-86 

sin 0 ft*' 1 = — i— - PfcV 1 } 8-41 

Jdl "T" 1 

we see that the integral over 6 is non- vanishing only if l = V ± 1. 
Similarly for the combination (x + iy) we have the selection rules 
m = m! — 1 , l = V ± 1 . The selection rules for the hydrogen atom 
may therefore be written as 

Al — dbl; Am = 0, ±1 842 

The selection rules for the different types of polarization are, of course, 
significant only when there is a unique z direction, due, for example, 
to the presence of a uniform magnetic field. This subject will be dis- 
cussed more fully in the following chapter, where the Zeeman effect is 
considered. It is apparent from the derivation of the above selection 
rules that they are not limited to the hydrogen atom but are valid 
for any central field problem where the angular portion of the wave 
function is identical with that of the hydrogen atom. 

8f. Selection Rules for the Harmonic Oscillator. Let us now con- 
sider the system consisting of a particle moving along the x axis with a 
harmonic motion. Let the charge on the particle be +e, and let the 
charge at the position of equilibrium be —e. The instantaneous value 
of the electric dipole moment will then be ex. The wave functions for 
the system will be the eigenfunctions of the harmonic oscillator (sec- 
tion 5e). A transition between the states n and m will be possible 
only if the integral 

(n\ex\m) = e J'tnX'I'm dx 843 

is different from zero. From the recursion formula (equation 4-99) 

= mH m — i -f- 844 

we see immediately that the integral 843 will be zero unless m = n ± 1. 
This is the selection rule for the harmonic oscillator; only the funda- 
mental frequency v can be emitted or absorbed by this system. We 
shall later require the exact values of the integrals in 843. The wave 
functions are 

I? 

M*) = N n e 2 H n (Z) 845 

where 
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so that 

(n\ex\n + 1) = N n N n+1 -^= J e~ (i H n (Z)%H n+1 (£) dx 8-46 
By the use of the recursion formula 8-44, this reduces to 
(n\ex\n + 1) = N n N n+l ~ JV fS // n (£)(n + 1 )#„(£) dx 


- 4 = (n + 1) f [*.(*)]* dx = -~= (n + 1 ) 8-47 

Va N n J Va Nn 

Introducing the explicit expressions for the normalizing factors, we 
obtain 

{n\ex\n + 1) = 8-48 

Similarly, 

(n\ex\n - 1) = N n N n _ t ~ J e~ p H n (.min-^) dx 


e Nn-i 
2V« N„ 



8-49 


8 g. Polarizability; Rayleigh and Raman Scattering. If a is the 
polarizability of an atomic system, then an electric field E induces a 
dipole moment R = aE in the system. Let us consider classically a 
system in which an electron of charge —e is bound elastically to an 
equilibrium position at which there is a charge +e. If the system is 
subjected to an alternating electric field of strength E = E 0 cos a >t, 
the classical equation of motion is 


d 2 r 

m — + /cr = — dS 0 cos cot 


8-50 


where r is the displacement of the electron from the origin and k is the 
force constant of the forces binding the electron to the equilibrium 
position. The steady-state solution of this equation is 


r = 


— eE 0 cos cot 
k — mco 2 


— -e* E 0 COS cot 
m 

2 2 
COq — CO 


fk 

where ug — yl~ . , The dipole moment of the system is R 


8-51 


-er; 
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the classical polarizability of the system is therefore 


e 2 /m 


If we 


now write r = A cos c ot, then the acceleration of the electron is 
d 2 r 

— == —Aw 2 cos cot. The time average of the square of the accelerar 
at 

tion is 


= Ujv cos 2 ut = iUiv 


8-52 


According to classical electromagnetic theory, an electron moving with 

2 c 2 a 2 

acceleration a radiates in one second the amount of energy — 3 • 

oC 

Classically, the system described above would therefore radiate light 

of frequency v = the amount of energy radiated per Second being 

e 2 ]A]V 

3c 3 

According to quantum mechanics, if a system is in a state a, then 
the probability of a transition to a state b with the emission of light 
of frequency v a b is, as we have seen earlier in this chapter, 

8-53 

The amount of energy radiated per second is 

= ^jr 1 Rail 2 8-54 

Let us now calculate the dipole moment associated with the transi- 
tion between the states a and b, which we define as 

R»6 = + (*rjer|«2) 

f .Bab. . Bgb t 'l 

= (a|er|b)\e* * + e * ) 

= 2(a]er|fr) cos — p t = 2(a|er|6) cos u a j,t 8-55 

71 


where o>db 


Egb 

h 


E a — Eb 


By analogy with the classical argu- 


ment, we therefore calculate the rate of emission of energy of frequency 
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v a b to be 



4c4 

3c 3 


|(a|er|6)| 5 


64t 4 4 

3c 3 


\Rab \ 2 


8-56 


which is identical with the quantum-mechanical result in equation 
8-54. This is a justification for the definition given in 8*55 for the 
dipole moment associated with the transition between two states a 
and b.. R a & may also be written as R 0 i> = 2Re{ (’Pa*|er|4'°)!, where 
Re means that we are to take the real part of the quantity in brackets. 
For the dipole moment associated with a given state 

R<j = 

A system which in the absence of an electromagnetic field is in the 
state represented by ^ has, in the presence of the field, the perturbed 
wave function 

¥« = *2 + Zcb^l 8-57 

b 

where c&, according to equation 8*21, is given by the relation 

e~*~ 

— — (- — 1 4- constant 8-58 

E ba + e Eba- e] 

where Eba = E b — E a ; e = hv. The dipole moment associated with 
the state SPo is 

f .Eab\ 

R 0 = Re{ ('J'alerl'S'o} = (o|R|o) + 2f^4I> ^ ,(a|R|^>)e , * / 

b 

= (o|R|a)+ifc{i:^(o|R|6)(6|R|o)-A8 

where R = er. In writing this last expression the constant in equa- 
tion 8*58 has been placed equal to zero. This can be done in this case 
without loss of generality, as the inclusion of this constant would give 
us no additional terms which would be functions of the perturbing 
fields. Equation 8*59 may be simplified to 

(a|R|&)(6|R|a)-A8sinjf 8-60 
ft 

Since Eo = — - -37 = -? A? sin 7 1, the dipole moment as a function 
c dt eft ft 




-i-t 

e n 


[Eba + € Eba — M 


8-59 


Cb = A ° ’ 
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of the electric field becomes 

R* = (o|R|a) + £ — Lr 1 TrV) -Eo 

b * \&ba “ € hj}> a + €/ 

= (o|R|o) + (o|R|6)(6|R|o) • E 0 

6 &ba ~ « 

= (a|R|o) + |E IT^ (o|R|6)(6|R|o) - Eo 8-61 

h b Ha — v 


The first term in this expression is the permanent dipole moment \x 
of the system. In order to determine the polarizability, we must 
write the remaining terms in the form dE 0 , which can be done by 
averaging the vector quantities over all orientations of the system with 
respect to the field, assuming all orientations to be equally probable. 
The quantity which we must average is of the form RR • E. If 6 is the 
angle between R and E, this is equal to R|iSj \E\ cos 0, which is a vector 
in the direction of R. The component of this vector along E is 
| R | \R\ 1 2? | cos 2 0. Averaging over all values of 0 , we obtain for the 
magnitude of the vector along E the value \R\ \E\, so that the 
average value of the vector quantity RR • E is -J-R • RE. Making this 
transformation in 8*61, we obtain 


*•- (a|E|o) +S5 


»y(4jW 


2 

Ha 


En 


8*62 


The second term is now of the form aEo, so that we have obtained the 
quantum-mechanical expression for the polarizability 


2 Via Rabl^ 

“ 3 hr via- V 2 


8-63 


The dipole moment R a contains a term which varies with the frequency 
p of the incident light. According to the above discussion, the system 
will therefore radiate light of frequency v, which will be in phase with 
the incident light and which is therefore called coherent. This co- 
herent scattering of light by an atomic system is known as Rayleigh 
scattering. 

Of more importance for the elucidation of problems in molecular 

K cture is the study of the phenomenon known as incoherent or 
gan scattering. In order to understand the origin of this phenom- 
£ let us calculate the dipole moment for the transition a b when 
System is subjected to radiation of frequency v. The wave func- 
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tions are 


= *a + Zcjfc*2 
ft 

I 


The dipole moment for the transition is therefore 
Rot = 2iJe{(**[R|*i)} 

• Egb . . Ekb . . Eal 

= 2i?e{(a|R|6)e * + Xx£(fcjR|6)e * + ]Cc 2 (a|R|Z)e * 

< k l 

where the coefficients c% and c* are 


°* = ~2ch~ (/c|R|a) ' A ° 


e | c 

E ka ~f” € i^ka € 




e n e n L 

^ Elf, — € 


Substituting these expressions for the coefficients in equation 8*65, we 
obtain for the dipole moment of the transition between the two states 
a and b the result 


.Egb f jT? 

Rot = 2 Re (a|R|6)e’ * - £ (fc|R|6)(A:|R|a) • k° 0 X 

ft 2 iCll 


+ £^1 («|R[Z)(i|R|6) - Ag X 


E ka 6 Eka ~ * J l 


+ C Elb — € 


Re i ie 


- sm (rir-; 


= sin ^ “ s * n ^ ^ ~~ ^ e ^ c - ^ 

the expression for the dipole moment R a & may be simplified to 

Ra b = 2(a|R|6) cos W - E “ (fclR|&) (fc|Rja) • Ag X 

& eft 

- sin2 ;^r ) ‘ ) -ft <»w‘><w» • ** 

(sin 2ir(r + v ab )t sin 2t(v - r 0 t)<l 


Eib + 6 


Eib — e 
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Since the summations over k and l are entirely equivalent we may re- 
place both of them by a summation over j and write 8*70 as 


El* = 2(o|R|6) cos 2irv a bt + £ ) X 

i \Vaj + v v jb + vj 

0'|R|6) • sin 2ir(v + v a b)t + £ (—~r X 

cn j \v b j + V Vj a + V/ 

(a|R|j) 0‘|R|6) • ~ sin 2x(v - v ab )t 8-71 
cn 


vhere v a j 



The dipole moment for the transition thus con- 


tains three terms varying with the time. If the matrix element (a|R|6) 
s not zero, light of frequency v a b will be emitted. This, of course, is 
ust the term corresponding to ordinary emission, as stated earlier in 
this chapter. We see, however, that if there exists some state j for 
vhich (a|R|i) and 0jR|^) are simultaneously different from zero, then 
ight of frequency v + v ab and v — v a b are also emitted by the system. 
The first frequency arises when the system goes from an excited state 
bo a lower state, thus adding energy to the field; the second arises 
vhen a system absorbs energy from the field. 

For a harmonic oscillator, the selection rule for the Raman effect 
& the following. Let us assume that the harmonic oscillator is origi- 
lally in the state a with quantum number n. Then the matrix ele- 
ment (a|R|j) will be different from zero only if the state j has the 
quantum number n±l. Similarly, if state b has the quantum num- 
ber m, (/(Rift) will be different from zero only if state j has the quantum 
lumber m =t 1. Both matrix elements will be simultaneously different 
:rom zero only if m *= n or m = n =fc 2, so that we may conclude that 
the selection rule for Raman scattering by a harmonic oscillator is 
\n = 0, ± 2. The first possibility corresponds to scattering of light 
}f the incident frequency v; the second corresponds to scattering of 
light of frequency v ± 2v e , where v e is the fundamental frequency of 
the harmonic oscillator. 

If we assume that v v a j we may write 


X — — (q|R|j)0'|R|O = — X(o|r|j)C/|R|W 

3 V V i 

= — (a|RR|6) 8-72 

V 

According to 8*72, the frequency v + v a b will appear in the Raman 

S t if any matrix element of the type* where X\> equal 

or z, is different from zero. We shall use this formulation of the 
ttion rules for Raman scattering. 
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The spectrum of a many-electron atom will in general consist of 
hundreds of lines with little apparent regularity. Before the develop- 
ment of the quantum theory it was found empirically that when the 
observed wavelengths were expressed as frequencies, a set of numbers 
could be assigned to a given atom such that all observed frequencies 
would be given by the difference of two numbers in this set, although 
not all differences would appear in the observed spectrum. In modern 
terms these experimental facts are expressed as follows. For a given 
atomic system, there exists a set of discrete energy levels, or “ stationary 
states,”. of energy Ei, E 2 * • • E n , • • •. Transitions between certain of 
these states are allowed, these transitions resulting in the emission of 


radiation of frequency v = 


E m •— E n . 


h 


if the transition is from the state 


designated by the letter m to that designated by the letter n. In this 
chapter we shall determine the number and characteristics of the states 
present in a given atom, derive the selection rules regulating the transi- 
tions between these states, and calculate theoretically, so far as these 
calculations are feasible, the eigenvalues and eigenfunctions of these 
states. This program will necessitate the inclusion of several additional 
features in our general theory, the first of these being the hypothesis of 
“ electron spin.” 

9a. The Hypothesis of Electron Spin. If the energy level diagram 
of an alkali metal, which has one valence electron, is drawn from the 
spectroscopic data, it is found that most of the levels appear in closely 
spaced groups of two. In the alkali-earth metals, with two valence 
electrons, the levels may be divided into two series; in one series the 
levels appear separately, in the other in groups of three. This multi- 
plicity of levels does not appear if the electron is assumed to have no 
properties other than mass and charge, as was dope in our previous 
treatment of the hydrogen and helium atoms. This difficulty 
resolved by Goudsmit and Uhlenbeck, 1 who introduced into the old 
quantum theory the hypothesis that the electron possesses an intruidjC 


1 G. Goudsmit and S. Uhlenbeck, Naturwissenschaften , 13, 953 (1925). 

124 



THE HYPOTHESIS OF ELECTRON SPIN 


125 


angular momentum — , with which there is associated a magnetic 
2t 

ch 

moment of magnitude . The interaction of the magnetic moment 

47 t me 

of the electron with the magnetic fields produced by the motion of the 
electrons in their orbits was then found to give rise to the splitting of the 
energy levels into the observed multiplets. 

The spin of the electron was measured experimentally by Stem and 
Gerlach. 2 A beam of silver atoms was passed through a strong inho- 
mogeneous magnetic field. It was found that the beam divided sharply 
into two beams. One beam was deflected as though each atom were a 

eh 

magnetic dipole of magnitude oriented along the field direction: 

47rrac 

the other was deflected like a magnetic dipole of the same strength 
but oriented in the opposite direction. There are good grounds for 
believing that the total angular momentum due to the electrons mov- 
ing in their orbits vanishes for the normal silver atom, and that the 
observed magnetic moment is due solely to the single valence electron. 
The Stern-Gerlach experiment is therefore a direct measurement of the 
electron’s intrinsic magnetic moment. 

It is impossible to explain by classical mechanics why the beam of 
silver atoms is split into two distinct beams; classical mechanics would 
predict only a broadening of the beam. If, however, we assume that 
the spin angular momentum obeys the quantum-mechanical laws of 
angular momentum, and assume that l = \ ) then we would expect that 
the beam would be split into two beams corresponding to the states with 
m = and m = — 

In view of this fact we make the assumption that the electron has spin 
angular momentum, represented by a set of operators S*, S y , S*, and S 2 
which are analogous to the operators M*, M y , M*, and M 2 for orbital 
angular momentum and obey the same commutation rules. According 
to the discussion in the previous paragraph, we assume that there is 


h 2 

only one eigenvalue J(1 + 2 ) r ~2 f° r 80 that there are only two 

471 - 


eigenvalues of S*, namely, 


hi~ and 



In referring to a general 


spin eigenfunction we shall use the quantum number s, writing s — for 

2tt 

the eigenvalues of S*. s may therefore have the values 
I^et us denote by a the spin eigenfunction corresponding to 8 * +J, 

%0. Stem and W. Gerlach, Z. Physik , 8, 110; 9, 349 (1922). 
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and by 0 the spin eigenfunction corresponding to s 
eigenfunctions then satisfy the equations 


02 3 h 2 

Sa = U? a 


S t a = - — a 

* 2 2tt 




S,0 = - i— 0 
2 2t 


These 


9-1 


9-2 


In order to find the effect of S* and S„ on these eigenfunctions we use the 
equations analogous to 3-132 and 3-133. These are, making the proper 
substitutions, 

h 

(S* — iS v )ot - — 0 

(S* — i'S„)0 = 0 
(S, + »S„)« = 0 

(S® + *S„)0 - £« 


From these equations we find by 

o h 

s *“” s'* 

« if 1 ~ 


addition and subtraction that 


Sx0 = 

h 

7~ a 

4i r 

* 

9-4 

s *0 = 

ih 

-~4w a 


9-5 


The effect of such operators as S X S V , etc., can be easily found by the 
successive application of the above rules. For example, 


S X S ya = 


ih 2 

16tt 2 


a; 


S y S x a = — 


ih 2 

I^ a 


9-6 


so that 


(S#Sy 


ih h 

a 

2tt 4ir 



9-7 

h 


as, of course, it must be if the commutation rules are to be satisfied 
This formal method of treating electron spin was developed by Paulk 1 
In order to fix in our minds more clearly the relation between angu|| 
momentum and magnetic moment, let us consider a particle of mass§| 
and algebraic charge e moving in a circular orbit of radius a wit|pl 
velocity v. The motion of the charged particle is equivalent to a curpn^ 
flowing along the orbit, the magnitude of the current being equal tc^k 

*W. Pauli, Z. Phyrik, 48, 601 (1926). 
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charge divided by the time required for a complete cycle, or 

„ e 


2ira/v 


ev 

2ira 


According to electromagnetic theory, a current I moving about a loop 

I A 

of area A is equivalent to a magnetic moment — . The magnetic 


moment associated with the motion of the charged particle is therefore 

r,2 


ev ir ar e e 

M = = — av = - — p 

2c 2 me 


9*8 


2 x 0 c 

where p is the angular momentum of the particle. The ratio of mag- 

netic moment to angular momentum, - — , is called the gyromagnetic 

2mc 

ratio. The classical result above gives the correct relation between 
orbital angular momentum and associated magnetic moment in quantum 
mechanics as well. For spin angular momentum, however, we obtain 
correct results only if we take the gyromagnetic ratio to be twice as 
large. The magnetic moment of the spinning electron is therefore 
represented by the operators 

e 0 

Hx = — s* 


\*>V = 


me 

= — S« 


9*9 


me 

— S K 
me 


where e and m are the charge and mass of the electron. In a magnetic 
field with the components H xy H V) and H g , the classical energy of a 
magnetic dipole is 

~ (H x p x + Hy\i y + H z p z ) 

The corresponding operator is obtained by replacing the p’s by the 
operators in 9*9, so that in a magnetic field the Hamiltonian operator 
Contains the operator 

- — (j H X S X + H y S u + H, S.) 9-10 



e z axis is taken to be the field direction, so that H x and H v vanish, 
educes to 
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The spin eigenfunctions a and /S are therefore the correct eigenfunctions 
of the Hamiltonian operator as well as of the operators S* and S 2 . If 
the field strength is allowed to become zero this argument still holds, 
so that even if there is no magnetic field we still use a and /3 as our spin 
eigenfunctions of the Hamiltonian operator. 

9b. The Electronic States of Complex Atoms. For an atom of 
atomic number Z, the approximate Hamiltonian is 


H = - 


h 2 


8ir 2 m i 


Evf 


Ze 2 e 2 

Z — + Z- 

» U i>j Tij 


912 


where V? is the Laplacian operator for the rth electron, 77 is the distance 
of the zth electron from the nucleus (motion of the nucleus can be 
neglected in heavy atoms), and 777 is the distance between the ith andjth 
electrons. The exact Hamiltonian contains additional terms represent- 
ing the magnetic interactions of the electronic orbits and spins, which 
we will disregard for the present. 

e 2 

Because of the presence of the terms — in the Hamiltonian operator 

Tij 

it is not possible to separate the variables in the wave equation. The 
problem must therefore be treated by approximate methods, but it is 
gratifying to find that all the qualitative features of atomic structure can 
be obtained with little actual computation. As the starting point of 
the perturbation theory we use the set of one-electron eigenfunctions 
which are the solutions of the wave equation with the Hamiltonian 

H 0 - -ilvf + EFW 9-13 

o7T 77% f i 


where 7(77) is an effective potential for the zth electron. The differ- 
ence between 9-12 and 9*13 is treated as the perturbation 

Ze 2 e 2 

H' = -£— + L- - EF(t7) 944 

i ?7 i>j r ij i 


In order to obtain a good set of zero-order functions, V (r t ) should be 
taken to be such a function of 77 that H ' is as small as possible. If 
is much greater than any of the other r/ s, the field due to the remainiH| ; 
electrons will approximate that of a charge — (Z — l)e at the origin, 

2 !W 

er 

that V (77) , in which case we say that the nucleus h^s been 

77 


screened by the other electrons. If 77 is much smaller than any 
other 7*/s, the field due to the other electrons is approximately th 
inside of a spherical shell of charge, which is a constant 
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V{Ti) ■= 


Ze _ 2 

r. 


+ constant. 


As the general features of the electronic 


states are not dependent upon the explicit form chosen for V(r ), a 
further discussion of the radial part of the wave equation will be reserved 
for later sections of this chapter. 

The eigenfunctions of the approximate Hamiltonian 9*13 are known 
as atomic orbitals. Since we have chosen a potential function which 
is a function of r only, they will be similar in many ways to the hydrogen 
eigenfunctions. The factors of the orbital depending on 0 and <p are 
of the form P\ m 1 (cos 6)e im<p , and the radial factor is similar to the radial 
function of hydrogen. Each orbital may therefore be labeled by quan- 
tum numbers n, l, m, analogous to those in hydrogen, and we may speak, 
of 2s orbitals, 3 p orbitals, etc. One important difference between these 
atomic orbitals and the hydrogen eigenfunctions is that the energy of a 
hydrogen eigenfunction is a function of n only, whereas the energy of an 
atomic orbital depends on l as well, so that, for example, a 2s and 2 p 
orbital have different energies. Since the energy of an atomic orbital 
is not a function of m, the value of this quantum number is not specified 
when we describe an orbital. 

9c. The Pauli Exclusion Principle. Having once chosen an appropri- 
ate set of atomic orbitals we must combine these orbitals into approxi- 
mate eigenfunctions for the entire atom. This can, of course, be done 
in a variety of ways, depending on the choice of orbitals from the chosen 
set. Let us suppose, however, that we have decided to construct an 
eigenfunction corresponding to a state in which the numbers of Is, 2s, 2 p, 
etc., electrons are all fixed. To be concrete, suppose that the state is 
one with two Is electrons and one 2p electron. Let us denote our orbital 
by <p(n, l, m\i), giving the first part of the parentheses the three quantum 
numbers, and in the other part the number of the electron occupying 
the orbital. Then we might take the product 


*(100|l)*(100|2V(21l|3) 9-15 


as an approximation to the eigenfunction for the desired state. We 
might equally well have chosen 

?(100|3V(100|2V(2lI|l) 9-16 

{<p( 2ll|l) = <p( 2, 1, — 1 1 1 ) } or any similar product. Up to the present 
we have not considered the spin of the electrons. Since the Hamiltonian 
9-12 does not involve the spin in any -way as yet, we may take the 
functions a and 0 of section 9a as representing the two possible spin 
states of each electron. If we insert the s value after the three quantum 
numbers n % l , m in our symbols for the orbitals the eigenfunctions for our 
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chosen state might take on the appearance 

*>(100§|l)*(100j|2)*>(100j|3) 9-17 

In this way we can write an eigenfunction, in fact a number of eigen- 
functions, for any state. But we could by this procedure write eigen- 
functions corresponding to states which are never observed. For 
example 

*(100*|l)*(100*|2)*(100*|3) 9-18 

would represent a state in which the atom had three Is electrons, all 
with positive spin. If we were to calculate the energy of this state in 
lithium by the variation method we would find a lower energy than that 
of any known level, and yet the theory of the variation method tells us 
that our calculated energy must be too high. 

The way out of this difficulty was discovered by Pauli. 4 Before 
quantum mechanics took on its present form Pauli put forth the follow- 
ing exclusion principle: no two electrons can have all their quantum 
numbers, including spin quantum number, the same. With the new 
quantum mechanics this has been changed to the following statement, 
from which the previous one may be derived: every eigenfunction must 
be antisymmetrical in every pair of electrons (or more generally in every 
pair of identical elementary particles); that is, if any two electrons are 
interchanged, the eigenfunction must remain unchanged except for the 
factor — 1. The electron density distribution for a given state is given 
by the square of the corresponding eigenfunction. If two electrons are 
interchanged, the electron density distribution must, of course, remain 
unchanged. This means that the eigenfunction must either be unaf- 
fected by this interchange of electrons or must change only in sign. It is 
found empirically that we obtain a correct description of nature only if 
we always adopt the second alternative. 

It is not immediately apparent that the new form of the exclusion 
principle includes the old. Let us, however, consider the determinant 

*(100*10 *(100*|2) *(100*|3) 

Z) = -JL ^(lOOflO *(100j|2) *(100*|3) 9-19 

*(211*|1) *(211*|2) *(211*|3) 

If we interchange any two electrons in this eigenfunction, say electrons 1 

and 2, then two columns of the determinant are interchanged. This 
causes a change of sign of the determinant, so that this eigenfunction 
satisfies the exclusion principle in the new form. Any antisymmetric 
eigenfunction can be written as such a determinant or as a linear combi- 
4 W. Pauli, Z. Phytik, 81, 765 (1925). 
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nation of such determinants. Now if we attempt to set up an eigen- 
function in which two electrons have the same set of quantum numbers, 
and write the eigenfunction in its determinantal form, it is seen that two 
rows of the determinant are identical. But such a determinant, one 
with two identical rows, vanishes, and the eigenfunction vanishes. 
Therefore the old form of the exclusion principle is included in the new. 

Accepting the exclusion principle in its new form as a new postulate 
to our theory, we are no longer permitted to use the simple products 
such as 9*15 in our future discussion, but only linear combinations of 
determinants such as 9*19. In other words, though we can construct 
eigenfunctions corresponding to states in which a definite set of orbitals 
is occupied, we cannot specify which electron occupies a particular 
orbital. 

In most atoms the energies of the various orbitals lie in the order 
1.5, 2s, 2 p, 3s, 3 p, 4s, 3d, 4p, 5s, 4d, 5p, 6s, 4 /, 5d, 6p, 7s, 6d, the Is orbitals 
having the lowest energy. When the atom is in its normal state, its 
Z electrons occupy the lowest orbitals possible without violating the 
exclusion principle. Thus hydrogen, with one electron, normally has 
this electron in the Is orbital, but in lithium one electron must go into 
the 2s orbital, since there is no way in which three electrons can occupy 
the Is orbital without violating the Pauli exclusion principle. The 
group of two Is electrons is present in all the elements of higher atomic 
number than 1. Such a complete group is known as a closed shell. 
With beryllium, the 2s shell is completed, and the fifth electron in boron 
must occupy a 2 p orbital. Since there are three different 2 p orbitals, 
corresponding to the values 1,0, — 1, for the quantum number m, and 
since each of these may have either of two values for the spin quantum 
number associated with it, the 2 p shell can hold six electrons. In the 
elements from boron to neon the 2 p orbitals are being filled. When we 
come to sodium, however, there is no more room for 2 p electrons, and so 
the eleventh electron goes into a 3s orbital. Magnesium completes the 
3s shell, and from aluminum to argon the 3 p orbitals are filled. In this 
way we build up all the atoms in the periodic table. There are a few 
irregularities which will be discussed in section 9m, where we shall con- 
sider the properties of the atoms in the periodic table from the view- 
point of their electronic structure. 

A state of an atom in which the number of electrons in each shell is 
specified is called a configuration. The usual notation for a configura- 
tion is the following. We write the designations Is, 2 p, etc., for the 
various shells, with exponents indicating the number of electrons in the 
shell. Thus the normal configuration of lithium is (Is) 2 (2s), and the 
normal configuration of phosphorus is (ls) 2 (2s) 2 (2p) 6 (3s) 2 (3p) 8 . 
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9d. The Calculation of Energy Levels. 6 For a closed shell of 
electrons only one eigenfunction may be written. Thus for a shell of 
six 2 p electrons we may write 


1 

V6! 


<p( 21lf|l) 
*(211$|1) 
*(210j|l) 
?(210j|l) 
v( 21TJ11) 

^(21TJ|1) 


No other determinant is possible. 


*>(21l!|6) 


( 


<p(211ijrj6) 

1 


The factor 


V6! 


is a 


9-20 


normalizing 


factor; the determinant as written will be normalized if the functions <p 
are normalized. For N electrons the factor is — . ) Hence for a 

Vn\ ) 

closed shell the eigenfunction is completely determined by the configura- 
tion. For an incomplete shell, however, there are always a number of 
possible eigenfunctions. For example, the configuration (l$)(2p) has 
twelve possibilities. Abbreviating the determinants by writing just 
the principal diagonal between bars, these are 


= l^ (100 ^l 1) ^ (211 JI 2) l 

D* = ^ |*(100!|lM210i|2)| 

^4 = ^ |* (10 °I1 1 M 210 JI 2) I 

D 6 = ^ I^IOOJI l>(2lli]2) | 
Z) 6 = -^ |„(100§|lM2lI||2)| 


Dl = ^l*(iooJ|i)*(2iiJ|2)| 
D * = ^k i °pii i v( 2ii }i2)i 
Dg = ^|«’( 100 J| 1 V( 21 0||2)| 

Dl ° = ^21 ^ ( 100 il 1 ) «’ ( 21 °JI 2) l 

D n = ^^(100j|lV(2lI^|2)| 
D 12 = |^(100}|1V(21IJ|2)| 


6 The methods used in this and several Subsequent sections are essentially due to 
J. C. Slater, Phys . Rev., 34, 1293 (1929). 
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Let us suppose that in the general case there are n determinants 
Du D 2 , • • • D n corresponding to a given configuration. The first-order 
perturbation theory for degenerate systems then tells us that the energy 
levels will be given by the roots of the equation 


H „ 

- S n E 

H 12 — S12E * • • 

••• H ln 

- S ln E 


H 21 

- S n E 

i /22 — S22 E * * * 

••• H 2n 

- S 2n E 

= 0 9-21 

Hn 1 

— S n iE 

H n 2 S n 2 E * * * 

* * * Hnn 

S nn 2? 



where Hij = 


J D*HDj dr; S {j = J D* { Dj dr. 


Equation 9*21 is analo- 


gous to 7*50, except that the D functions have not been assumed to 
be orthogonal. 

9e. Angular Momenta, The large number of states arising from 
most configurations makes the direct solution of equation 9*21 a very 
difficult task. Equation 9*21 can be simplified by the use of the opera- 
tors representing the various angular momenta of the atom. Represent- 
ing the operators for the components of orbital angular momentum of the 
various electrons by Mai, M v i f M z i; M X 2 , M^, M Z 2 , etc., we define the 
operators for the components of the total angular momentum by 


M x = M x i + M32 + * • • 

M y = M yl + M y2 + • • • 9-22 

M 2 = M 2 i + M Z 2 + • • • 

The operator for the square of the orbital angular momentum of the 
atom is 

M 2 = Ml + Ml + Mf 9-23 

At the same time we define the components of the total spin angular 
momentum by 

S x = Sal + Sa2 + * • • 

S y = Syl + Sy2 + * * * 9*24 

s* = S*i + S Z 2 + • * • 

For the' square of the total spin angular momentum the operator is 

S 2 = S|+ S 2 y + S 2 Z 9*25 
Finally there are the operators for the components of total angular 
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momentum and its square 

Jx = M x + S x 
J„ = M„ + Sy 
J. = M, + S, 

J 2 = J! + Jl + fi 


9-26 


Classically all these angular momenta are constant. The quantum 
analogue of this is that all these operators commute with the Hamilton- 
ian operator H. That S x , S„, S 2 , and S 2 commute with H is obvious, 
since H does not contain the spin coordinates. To show that the others 
commute with H we first consider M*. In polar coordinates this 
operator is 
d 

dip 2 


M, 


A/JL 

2 -iri \d(pi 


+ 



The only terms in H which involve the <p s are the parts of the Laplacian 

c} 2 d C 2 , 

containing — 5 > which commutes with — , and the terms in — . Now 

dip dip Tn 

— involves ipi and <p 2 only in the combination (ipi — ip 2 ), so that 
*"12 


(; 


iL + A + 

d<p 1 d<p2 


r 12 


d(ipi — ip 2 ) 
= 0 


/ e 2 \[ a(y>x — <92) , djfpi — <p 2 ) ] 

V12/1 d<Pi dip2 J 


Hence M z commutes with H. Since H is spherically symmetrical, 
M x and M„ must therefore also commute with H, as must M 2 . The J’s 
must likewise commute with H, for they are combinations of operators 
all of which commute with H. 

These operators do not, however, commute among themselves. 
Making use of equations 3-103, and the fact that the various angular 
momenta of one electron commute with those of any other electron, we 
find that 

m x m„'- M„M x = 

ATT 

M„M* - M*Mj, = ^ M x 
2w 

M,Mx - M X M* = £ My 
2 tt 


9-27 
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Similar equations hold between S*, S„, and S*, and between J z , ] y> and 
J*. Just as M 2 commutes with M*, M y , and M* for one electron, 
M 2 , S 2 , and J 2 commute with their components. Of course M 2 and its 
components commute with S 2 and its components. M 2 and S 2 also 
commute with J 2 and its components, but the components of M 2 and S 2 
do not commute with either J 2 or its components (except like components 
such as M*, S„ and J*). 

We can therefore select from among the angular momentum operators 
various sets which commute among themselves and also with H. Thus 
we might choose M 2 , S 2 , S*, and J z or M 2 , S 2 , J 2 , and J*. Now 

according to Theorem II of Chapter III, the eigenfunctions of H can be 
chosen so that they are simultaneously eigenfunctions of all the operators 
in any one of these sets. Moreover, if this is done the matrix compo- 
nents of H between eigenfunctions which have different eigenvalues for 
any one of these operators will vanish. Therefore, if we take a set of 
linear combinations of the D functions described above such that each 
combination is an eigenfunction of all the operators of a set of commut- 
ing angular momenta, then in equation 9*21 the Hi/s and Si/s will 
vanish except between those combinations which have the same eigen- 
values for all operators of the set. The appearance of 9*21 will then be 



9-21a 


if we arrange the combinations of D’s in such a way that the combina- 
tions with the same eigenvalues for the angular momenta are together. 
The shaded portion of the diagram represents non- vanishing H’s and S’s; 
the remainder of the determinant is occupied by zeros. Now a determi- 
nant such as the one in 9*21a is equal to the product of the shaded 
determinants along the diagonal, so that the roots of 9*21a are just the 
roots obtained by putting each of the shaded determinants successively 
equal to zero. The solution of 9*21 is thus reduced to the solution of a 
number of equations similar in form but of smaller degree. 

9f. Multiplet Structure. As long as we use the Hamiltonian 9*12, 
which does not contain the spin terms, it does not matter what set of 
angular momenta we use to reduce the secular equation 9*21 to the form 
9*21a. If the complete Hamiltonian were used, the only angular 
momenta which would commute with it would be J 2 and its components. 
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Although the other angular momenta do not commute with the exact 
Hamiltonian, it is found that usually M 2 and S 2 “ almost commute ” 
with H; that is, HM 2 — M 2 H is small. , On the other hand, M«, S*, etc., 
do not “ almost commute ” with the exact Hamiltonian. It would 
therefore seem logical to pick the set M 2 , S 2 , J 2 , and J* as the set of 
angular momenta to be used to reduce the secular equation. We shall 
presently see, however, that the eigenfunctions of the type described 
above are already eigenfunctions of M 2 and S*. For this reason it is 
more convenient to choose M 2 , S 2 , M*, and S* as our set of angular 
momenta as long as we are using the approximate Hamiltonian 9-13. 
If it is desired to find the eigenfunctions of J 2 and J z , these may be found 
later in terms of this set. 

Our next problem is then to combine the determinantal eigenfunc- 
tions, the D functions, into new trial eigenfunctions which are eigen- 
functions of M 2 , S 2 , M 2 , and S 2 . We shall first show that these trial 
eigenfunctions are already eigenfunctions of M 2 and S z . Let us take a 
general determinantal eigenfunction 

D = — . == \(p(nilimiSi\l)<p{n2l2^282\2) • • *| 9*28 

VN\ 

A typical term in the expansion of this determinant is 
<p(nilimi$i\l)<p(n 2 l2m2S2\2) * • • 

Operating on this term with M 2 , we find that the term M 2l gives the 

h 

same product multiplied by mi — , the term M* 2 the same product 

2w 

h 

multiplied by m 2 , and so on. The total result of operating with M* 
2 r 

h 

is the sum of these, or (mi + m 2 H ) — times the original product. 

27 r 

Since this holds for every term in the expansion of the determinant, it 
holds for the whole determinant, which is therefore an eigenfunction of 

M. with the eigenvalue Ml~, where M £ = m\ + m 2 H . By an 

exactly similar argument the D functions are eigenfunctions of S, with 

h 

the eigenvalue M s — , where M s = + 82 + • • •. For the list of 

2t 

possible D functions for the configuration (ls)(2p) given above, we 



MULTIPLET STRUCTURE 


187 


have, for example, 


M,Z>! = 

h n 

2tt Dx 

M ,Z> 6 = 

h n 
~2 

9-29 

S,Di = 

h n 

2t Di 

S 2 7> 6 = 

0 

9-30 


Let us now consider the effect of M 2 on the D’s. A typical term in the 
expansion of a D function is 

(p(nilimiSi\l)<p(n2hm2S2\2) * • * 9*31 

Now 

M 2 = (M xl + M x2 + • • -) 2 + (M„i + M y2 + • • -) 2 

+ (M z i + M 2 2 + * * *) 2 

= M? + Mi + M| + - • • 

+ 2M z iMz2 + 2M 2 iM 2 3 + 2M z2 M z3 + * * • 

+ (M z i + iM 2/ i)(M X 2 — iMj^) 

+ (M xl - iM vl )(M x2 + m v2 ) + ••• 

Using equations 3*127, 3*132, 3*133, we find 

M 2 [<p(nilimiSi\l)(p(n2l2ni2S2\2) • • •} 

h 2 

~ j~2 \Mh + 1) + h(h + 1) + • * • + 2m\rri2 + 2 m\m 3 

+ 2 m 2 m 3 + * • *][^(niiimiSi|l)^(n 2 ^2S2|2) • * •] 

+ [(k(h + 1) - n%i(mi + 1 ))(l 2 (h + 1) - m 2 (m 2 - l))] w 
[<p(nih, mi + 1, sx\l)<p(n 2 l 2y m 2 - 1, s 2 |2)?(n 3 Z 3 m 3 s 3 |3) • • •] 

+ [(h(h + 1) ~ mx(mi — 1 ) ) (J 2 (^2 + 1) — ^2(^2 + l))] w 
[<p(nili, m\ — 1, $i\l)<p(n 2 l 2 , m 2 + 1, s 2 |2)^(n 3 i3W 3 s 3 |3) * • *] 
H } 9*32 

If we represent by D{ (nilimiSi) (n 2 l 2 m 2 s 2 ) • • •} the D function whose 
principal diagonal is 9*31, we see that M 2 affects each term in its expan- 
sion in the same way, and hence that 

M 2 U{(n 1 Z 1 m 1 si)(n2Z 2 m2S2) • • •} 
k 2 

= 71 {Hlkili + 1 ) + 2"£m t m j ]D{(n 1 lim 1 si)(n 2 l 2 m2S2) • • •} 

w t i>j 

+ Z) [(k(U + 1) — mi(mi + 1 + 1) — my(m ; * — 1))]** 

»>/ 

Z){ (nxlxmisx) * * * (n^*, mi + 1, **) * * • (n y Zy, ra y - 1, s y ) * * *} } 9*33 
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Analogously 

S 2 D{(nilimi8i)(n2l2^2S2) '• • *} 
h 2 

- 71 ULkKW + 1) +21 s i Sj]D{(n 1 l 1 m 1 s 1 )(n 2 l2m2S2) ■ • •} 

47T i i>; 

+ E [(I + S,)(| - Si)(f - Sj) (I + s,)] H £>{ (nj^imiSi) • • • 

{riilitni, Si + 1) • • • ( njljrrij , sy — 1) • • •} } 9-34 

In the same way we find the effect of (M, ± iM„) and (S x ± iS y ) on 
the D functions to be 

(M* db tM^DKniZiWtxSi) • • •} = ~ Zlkik + 1) — m,(m f ± 1)] M 

Ztt i 

D{ (nilimiSi) • • • ( nilirrii ± 1, si) • • •} 9-35 

(S* ± ■■■} = - U(i ± *)(i =F «)]* 

^7T i 

D{ (ni^miSi) ■ • • (nikmi, s< ± 1) • • •} 9-36 

We thus see that the D’s are not themselves eigenfunctions of M 2 or S 2 . 
We can, however, form linear combinations of D’s which are such eigen- 
functions. First, it is obvious from 9-33 that M 2 does not change the 
value of Ml or M s . The combinations that we desire will therefore 
involve only D functions with the same M l and M s . We therefore put 

A = 'ZaD i 9-37 

i 

where D t - is one of a set of D functions arising from a given configuration 
and having the same M l and M s . We wish A to satisfy 

M 2 A = \A 9-38 

h 2 

and we know in advance from 3-127 that X is of the form L(L + 1) ti * 

47 r 

Since M l has an integral value, L must also be integral. Substituting 
9-37 into 9-38 gives 

M 2 ZciDi = XZciDi 

% % 

Multiplying through by each of the D* in turn gives a set of equations 
analogous to 7*49 

- XSy.-K- = 0 9-39 

i X 

where M% = D*M 2 Di dr. These equations are solved in the same 
way as 7*49. 
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Having found a set of A’s which are eigenfunctions of M 2 , M,, and 
S z , we repeat this same process with S 2 taking the place of M 2 . The 
resulting linear combinations of A’s are then eigenfunctions of M 2 , S 2 , 
Mj, and S z . We thus obtain a set of trial eigenfunctions B { which 
satisfy the relations 

h 2 


M 2 Bi = Li(Li + 1) -~2 Bi 
4ir 


940 


S 2 Bi = Si(Si + 1 )— 2 B. 


4ir 2 


M ,Bi = Mu — Bi 


941 

942 


StBi Mg ( Bi 

4TT 


943 


The final step is to use the B ’ s (taking only a set with the same L, S f Ml, 
and M s ) in equation 9-21 to determine the approximate energy levels 
and eigenfunctions of H. Before going into this last step in detail, we 
shall note some of the properties of the B functions. 

The L and S values of a B function (or an energy level whose eigen- 
function of H is a combination of B functions) is denoted by a capital 
letter showing the L value, the letter being chosen from the same code 
which is used in denoting the l value of an atomic orbital. The value 
of 2S + 1, the multiplicity of the term or group of levels with the same L 
value, is placed as a left-hand superscript. Thus a B function with 
L = 1, S = f is denoted by 4 P (quartet P); a B function with L = 0, 
S = 0 is denoted by X S (singlet S ). 

If any one of the B’s is picked out, a set of associated B’s can be found 
by applying the operators (M x ± iM. y ) and (S x =L iS y ) to the original B 
function until no new functions arise. All the members of such a set 
will have the same L and S values but will have all the possible M l 
and M s values consistent with the L and S values. Such a set is known 
as a “ term.” If there is more than one term of a given L and S value 
arising from a given configuration, the B ’ s arising from one of the B 
functions with a given Ml and M s upon multiplication by (M x =fc iM v ) 
and (S* db iSy) may not be the same as those found directly from per- 
turbation theory. These may, however, always be combined in such a 
way that the terms do not get mixed. In future work we shall assume 
that this has been done. 

Now let us consider a matrix component of H between any two B 


functions, J Bi'RB 2 dr, where B x and B 2 have the same value of L, S, 
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Ml, and Mg. Let B[ be the member of the term of Bi derived from B\ 
by the application of the operator (M x + fM v ), thus having an Ml 
value greater by 1 than B x has. Let B' 2 be the corresponding function 
derived from B 2 . Then from equations 3-133 

2ir 

B[ = - [L(L + 1) - M l (M l + 1)]-*(M. + iM v )Bi 

B' 2 =j [L(L + 1) - M l (M l + l)r H (M x + iM v )B 2 
The matrix element of H between B[ and B 2 is 
f B[*KB’ 2 dr = ^ [L(L + 1) - M L (M L + l)]" 1 

f '[(M* + + tM u )B 2 dr 

4-jr 2 r 

- -Jjr i ■ -r 1 J B*H(M X - tMy) (M x + tM v )B 2 dr 

- ^ [• • 'I" 1 / £*H(M 2 - M 2 - A M z )B 2 dr 


4 tc 

‘ HF 


[L(L + 1) - M l (M l + l)]- 1 ^^ + 1) - M 2 l - M l ) 


H Bo dr 


= j B*HB 2 dr 


This same relation holds for all the members of the term. Hence if we 
use any set of B’s belonging to a term, regardless of the M l and M s 
values, the matrix of H is the same, and therefore the approximate energy 
levels are the same. This is the reason that the M l and M s values of a 
B function are not specified. 

As an illustration of the construction of B functions let us consider 
the configuration (2p) 2 . The possible D functions are: 


Dx -2>{(21li)(21li)} 
Z) 2 «D{(211§)(210§)} 
D, -D{(211*)(210£)} 
D 4 *= D{ (21lJ)(210§ ) ( 
D 5 - D{(21lJ)(210f)I 


M l M s 
2 0 

1 1 

1 0 

1 0 

l - 1 
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M l 


D« = D{(21l£)(2lT*)} 

0 

1 

D 7 = D{(211§)(21IJ)} 

0 

0 

D 8 - Z){ (21li)(21lJ)} 

0 

0 

D» = D{(210§)(210|)} 

0 

0 

Dio = D{(21lJ)(2lT|)} 

0 

-1 

D u = D{(210i) (211^)1 

-1 

1 

D 12 = D{(210j)(2lTj)) 

-1 

0 

D 1S = D{ (210})(211§)1 

-1 

0 

D 14 =D{(210j)(2lI})l 

-1 

~1 

Dm - D{( 21l|)(21l})| 

— 2 

0 


Since D\ is the only D function with Ml = 2, M s = 0, it must be a 
B function. Applying M 2 and S 2 to it, and using 9*33 and 9*34, we find 

^ (2 + 2 + 2 )D X = 2(2 + 1) ^Z)i 

S 2 Dt = ~ [p - \2]D{ (21li)(21li)} 

+ (1)(1) (1)(1)Z>{ (21lJ)(21l£)} 

and since D{ (21lJ) (211^) } = -D t 

S 2 D X = ^ (£>i - Z>i) = 0(0 + 1 )D, 

we conclude, therefore, that, having L = 2, S = 0, Z>i is a B function 
belonging to a l D term. 

As an example of the procedure when there is more than one D func- 
tion with given M l and M s values, consider D7, Dg, and D g . Equa- 
tion 9*33 gives 

M^D 7 .2^D 7 + 2^D, 
HPD S .2^D S ~2^D, 

M^. = 2^D 7 -2^D,+4^B 9 


The secular equation for the eigenvalues is therefore 


2 -L(L + 1) 
0 
2 


0 

2 — L(L + 1) 


2 

-2 

4 — L(L + 1) 


= 0 


-2 
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which is satisfied by L = 0, 1, and 2. For L = 0, equations 9*39 become 

2c\ -f- 2 c 3 = 0 
2c 2 — 2c 3 = 0 
2ci — 2 c 2 H- 4c 3 = 0 

which gives C\ : c 2 : c 3 = —1:1:1. Hence we put 
A 7 = ( — Z) 7 + Z>8 + ^9) 


choosing the magnitudes of the c’s so that A 7 is normalized. It is easily 
verified that A 7 is an eigenfunction of M 2 with the eigenvalue zero. In 
a similar way we find for L = 1 

= \/2 

and for L = 2 

= (^7 £*8 + 21)9) 


We now calculate S 2 D 7 , S 2 D 8 > and S 2 D 9 by the use of equation 9*34. 

h 2 h 2 

h 2 h 2 

s 2 d 9 = 0 

Hence 

S 2 A 7 = 0; S 2 A a =1(1+1 ) A- + 8 ; S 2 A 9 = 0 


It follows that /I7, Ag, and A 9 are eigenfunctions of S 2 and may be used 
immediately as B functions. B 7 = A - belongs to a 1 S term; B 8 = A$ 
belongs to a 3 P term; and B 9 = A 9 belongs to a l D term. The complete 
list of B functions is found to be 


(*£) B x - D x 


i'D) B 9 = (D 7 -Ds + 2D 0 ) 


( 8 P) b 2 = d 2 


( 3 P) B 10 = 2>io 


( 8 P) B a = ~(D 3 +D,) 


( 3 P) B u = Bn 
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CD) B 4 = ± (D a - D 4 ) ( 3 P) Biz = — (D ia + D„) 

( 3 P) B 5 = D 6 (*D) 5x3 = ~ (D 12 - D 13 ) 

( 3 P) P 6 = D 6 ( 3 P) Pi4 = D w 

C-S) P 7 = ^ (“D 7 + P» 8 + D 9 ) CD) B u = D 1B 

( 3 P) B s = ^(D 7 + D 8 ) 

By operating with (M x zb iM v ) and (S x ± iS y ) on these functions we 
find that all the X D B functions belonging to a single term. In the same 
way the 3 P B functions form a term, and the single X S B function repre- 
sents a complete term. 

A configuration representing a closed shell has been seen to have only a 
single D function. This D function is always a B function of the type 
X S, having L = 0 and S = 0. If we compare the D functions arising 
from a configuration which includes one or more closed shells with the 
D functions arising from the configuration obtained by omitting the 
closed shells, we find that their properties are the same for operations 
with the operators M 2 , S 2 , M*, etc. The terms arising from these con- 
figurations are therefore the same. Hence, in the treatment of any 
atom, the closed shells may be omitted during the processes described 
in this section. Thus our example of the configuration (2 p) 2 serves as 
well for the configuration (l$) 2 (2s) 2 (2p) 2 of the carbon atom. 

9g. Calculation of the Energy Matrix. The matrix Hij of equa- 
tions 9*21 is formed from integrals of the type 

= J B*HBj dr 

Since the B’a are linear combinations of the D functions, these in- 
tegrals may be expressed in terms of the integrals 

H'ij = J DfBLDj dr 

Let us therefore consider this integral, taking as D* 

Di = -L \<p{a[\lMa' 2 \2) • • • | 


9*44 
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and as Dj 

a = ^k«ri i v«i 2) ---i 945 

where we have abbreviated the sets of quantum numbers n, l, m, s to 
the single symbol a, and N is the number of electrons in the configuration. 
If the determinants are expanded, 9-44 and 945 become 

Di = E • • -I 9-46 

= -== £ (-l)^(a''ilMa"|2) • • •} 9-47 

where P„ is an operator which permutes the variables 1, 2, 3, • • • among 
themselves, and v is the ordinal number of the permutations. H' {j may 
then be written 

= [^/ • • •}]* 

H[(-l) 1 'P,{ V ’«|lV(a2 , |2) • • •)] rfr 9-48 

Since all the variables are integrated out in each of the terms in this 
expression, we may interchange them in any way that we wish without 
changing the values of the integrals, provided that all the functions in 
any one integral have their variables changed in the same way. Sup- 
pose, therefore, that in each of the terms in 948 we change the variables 
by the permutation P 7 1 which restores the order of the variables in the 
first product to the normal order. Then 

Hi,- = -EE f [{*(a(|l)*>(<4|2) •••}]* 

H[(-l)’'P7 1 P.{p(o / 1 , |l)^(^ , l2).-- }]dr 9-49 

since H is not changed by permuting the variables. If we now sum 
over n we find that all the terms in the sum are equal, for the application 
of P~* to the whole set of permutations P„ gives the same set in a differ- 
ent order. Since there are N\ permutations P" 1 , we find 

Mai\lMa£\2) ■ ■ ■ }* 

H(-l)T,{*(ofll)*(ai'|2) • • • } dr 9-50 
The last product in this expression is of the form 
(-l)V(aj|lMo5|2)... 

where the aj are the same as the a" , or at most in some new order. 
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If we put 


h 2 2 Ze 2 e 2 

v 2 = U . _ 


8r*m 


V,y 


equation 9*50 becomes 

H' {j = LE(-l) 1 ' f {^(a{|l)^(^|2) • • • }U fc {^(ai|lV(4|2) • • • } dr 

k V V 

+ E E(-ir f {^(afllV*^) • • • } 

Kk y U 

V ki{<p(di\l)<p(a,2\2) * * * } dr 9*51 


Since U& is a function only of the coordinates of the fcth electron, we 
may now integrate over all the variables in the first of these integrals 
except those of the &th electron. The result is 


(- iyu(a' k , al) =(-!)■'/ <P*(4\k)U k <pK\k) dr 9-52 


if a\ = a'i for all i except i = k. Otherwise the integral will vanish 
because of the orthogonality of the orbitals. But if one of the P M makes 
all the a\ except one equal to the corresponding a' iy then any other permu- 
tation P M will destroy this equality, since no two a\ are equal. Hence 
only one term in the summation over v will remain. If we now sum 
over k , we find the total contribution of to H[j. This contribution is 

(a) Y,U{a f k1 a' k ) if Z)» == Dj. 

k 

( b ) U (a'i, a'/) if Dj differs from D{ only in that a' { ^ a". 

(c) If, by permuting the order of the orbitals in Dj by a permutation 
P, Dj may be made to satisfy the conditions of (a) or of (6), then the 
contribution is that of (a} or (6) times +1 if the permutation is even, 
or times — 1 if the permutation is odd. 

( d ) If two or more orbitals of Dj are different from those of Z) t -, there 
is no contribution. 


Rule (a) is a consequence of the orthogonality of the orbitals and the 
fact that no term in H involves the coordinates of more than two elec- 
trons. In the second integral of 9*51, we integrate first over all the 
coordinates except those of the fcth and Zth electrons. This integration 
gives zero unless every a[ = a\ except for i — k and i — Z, when it 
becomes 

(-iyV(aLal; <4, a\) 

= (- 1 )- dr 9-53 
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In considering the various permutations P„ we see that, if any P„ give 
a set of a\ such that a\ = a[ except for i = k } l, then any other permuta 
tion will destroy this property except the permutation which differ 
from this P„ only in the interchange of l and k. The sum over v there 
fore gives only two terms. Summation over k and l now leads to th 
following rules for determining the contribution of V&j to Hy. This con 
tribution is 

(a) E { V (4, a(; a' k , a[) - V(a' k , a[; a[, a k )} if Dj = Z>,. 

Kk 

(b) E { V (a' k , a' m ; a' k , a",) - V (a' k , a’ m ; a", a*)} if Dj differs from D 

k 

only in that a' m a". 

ip) a'; o!^ y a") - a'; a", a") if Dy differs from A ii 

that a' m 7 * a" and a' ^ a" but is otherwise the same. 

(d) That of (a), (6), or (c), if, by permuting the a " , Dj can be mad< 
to obey the conditions of (a), ( b ), or (c), times +1 if the permutation if 
even, or times — 1 if the permutation is odd. 

( e ) Zero if Dj contains three <p’s different from those of D,. 

Since neither U& nor depends on the spin coordinates of the orbi- 
tals, these coordinates may be integrated out of the expressions for JJ) 
and Vm. Using the orthogonality properties of a and /8 we see at once 
that i7(ai, a 2 ) = 0 if si 5^ 5 2 and that V (a^ a 2 ; a 3 , a 4 ) = 0 unless 
Si = s 3 and s 2 = s 4 . 

As the operators U& do not depend on the 6 or <p coordinates of the 
electrons, these may also be integrated out of 9-52. If we put 

v (nlms\i) = R(nl\i)Y(lm\i) (J“J) 9-54 

i(p)J 

then 

U(nilimi8i ; tt 2 i2m 2 s 2 ) 

= f R* (mh 1 1 ) ViR(n 2 l 2 1 1 ) F * {hmi 1 1 ) F (hm 2 \ 1 ) dr 

= I (n\l\ ; n 2 l 2 ) § Y* (hmi 1 1 ) F (J 2 m 2 | 1 ) sin 9 dd dtp S (si, s 2 ) 9-55 

where 

I (mli ; n 2 h) = f R*(n 1 h\l)V 1 R(n 2 l 2 \iydr 9-56 


and where 5(sx, 82)= 1 if si = S2 ; = 0 otherwise, 
orthogonality of the F’s, the integral 


f F*(iim 1 |l)F(f 2 m 2 |l) 


sin 6 dd d<p 


Because of the 



CALCULATION OF ENERGY MATRIX 


147 


vanishes unless l\ = k and mi = ra 2 , in which case it is unity. Hence 
U(nilimiSi; n 2 l 2 m 2 s 2 ) 

= 112 I 2 ) &(h, k) K m i) m 2 ) 5(«x, s 2 ) 9*57 


Since Vjti involves — , which depends on the variables 9 and <p f we 
Tkl 

cannot integrate out these variables in V(aia 2y a 3 a 4 ) directly. If, 

however, we use the expansion of — as given in Appendix V, we find, 

I'M 

after integrating out the spins, 

V(nilimiSi\ n 2 l 2 m 2 S 2 ; n 3 l 3 m 3 s 3 ; n 4 l 4 m 4 s 4 ) 

\ir 


l^n 21 + 


I {// DR*(n 2 l 2 \2) fi(«8fe|D 

R{n 4 l i \l)i\r^ dr 1 dr 2 X 
S S sin 0, d0, dipi X 


j' J' Y*(l 2 m 2 \ 02 >P 2 )Y(h'm i \d 2 <P 2 ) Y (Zm| 02 ^ 2 ) sin 02 d6 2 d<p 2 ) X 

^(^ 1 ^ 3 ) ^(^ 2 ^ 4 ) 9*58 


The integrals vanish unless m = m 3 — m\ — m 2 — m 4 , so that the 
summation over m is reduced to a single term. The remaining integral 
over the angles is of the form of a product of two integrals which we 
denote by 


cu(l im; km 2 ) 


4tt 


2fc -f* 


-jJ J* Y*(limi\d<p) Y ( lm\d(p ) Y (l 2 m 2 \9<p) sin 9 d$ d<p 


where m = mi — m 2 . Now unless mi + m 2 = m 3 + m 4 in equa- 
tion 9*58 it is impossible to find an m such that m — m 3 — m\ = 
m 2 — m 4 . Hence 7(ni • • •) = 0 unless mi + m 2 = m 3 + m 4 , and of 
course $1 = S 3 ; s 2 = s 4 . When these conditions are fulfilled 


V(nilimx8x; n 2 km 2 s 2 ; n 3 l 3 m 3 s 3 ; n 4 Z 4 m 4 s 4 ) 

= mci(km3;limi)ci(l 2 m 2 ; l 4 m 4 )Ri(nili, n 2 Z 2 ; W 3 Z 3 , n 4 i 4 ) 9*59 

1 


where 

Riimh, • • • nth) = ff R* (nih\l)R* (n2h\2) ^ 

R(n a l 3 \l)R(n 4 h\2)iiti dr, dr a 960 
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The integrals I(nik; n 2 l 2 ) and Ri(n x l i, n 2 l 2 ; ^ 3 , w 4 i 4 ) cannot be 
evaluated unless the forms of the <p’s are known, so that we have reduced 
the calculation of H # to its simplest form. 

To summarize the procedure for calculating H-, there are four steps: 

1. We are given two D functions, D(a[,a 2} a£,- • *)and D(a!i , 0 ," • ••). 

We first permute the sets of quantum numbers a", a 2t • • • until as 
many sets a'/ as possible are identical with the corresponding sets a*. 
If the permutation which effects this arrangement is an odd permutation, 
we multiply the result by —1. 

2. We now express H' {j in terms of the U (a», ay) and V (a*, aj; a*, aj). 
There are four cases: 

(a) After the permutation, the two D functions are identical. Then 

By = YLU{a ky a k ) + lL{V(a k , ay, a k , a{) — V(a k , a*; ai, a k )\ 9*61 

k k<l 

( b ) After the permutation the two D functions are identical except 
for one set of quantum numbers, say the fcth, so that a k 9 ^ a k . Then 

U* y = U{a kl <z") + E{7(ai[, a t ; a' k ', a*) - V{a k) a t ; a h a' k ')} 9*62 

l*k 

(c) After the permutation the two D functions differ in two sets of 
quantum numbers, say a k = a " and a[ ^ a". Then 

H'ij = V(a ' k , a[\ a’ k a[ f ) - V(a' k) a' t ; a'/, a' k ’) 9*63 

(d) After the permutation, the two D functions differ in more than two 
sets of quantum numbers. The contribution to is then zero. 

3. The Ufa, ay) and F(a*ay; a k ai) are expressed in terms of the 

integrals nyZy) and RiiriiU , nyZy ; n k l k , niU). We have 

U (didj) = I (fljlij Tljlj ) 5(Zj, lj) 8(jtl{ } 7flj) <5 ($*, Sj ) 

V(a x a 2 ; a 3 a 4 ) = Zcf(Z 3 ra 3 , limi)ci(l 2 m 2f Z 4 m 4 ) 

1 

Ri(n\h, tyfo) w 3 Z 3 ; ^ 4 Z 4 ) &(si, S3) 8(s 2 , $4) 

8 (mi + ra 2 , m 3 + ra 4 ) 9*64 

4. The integrals I (niU; nyZy) and Z2(n t Zi • * •) are evaluated and the 
results for Hij are substituted in 9*21. The integrals Sij are equal to 
8{j, since the D’ s and B \ s are orthogonal. 

In most of the calculations of this type it will be necessary to calculate 
only diagonal elements for the energy matrix. This will be the matrix 
element between one D function and itself, so that we have case 2a. 
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The energy matrix in this case may be written quite simply as 

Hu = E/(«* <**) + ZJ(k, l ) - ZK(k, l ) 9-65 

k k<l k<l 

where the “ coulombic ” integral J (k, l ) is defined as 

J(k, l ) = y(a t oi; a*ai) = iy(a*, ai)F l {a k , ai ) 9-66 

i 

and the “ exchange ” integral Z) is defined as 

K{k, l ) = F(a*, af, a h a k ) = 5(sjb, s{) 52b\a k , ai)G l (a k , a t ) 9-67 

i 

where 

a*(a fc , aj) - c*(a*, a k )c t {a h ai) 
b (( x&, ai) — Ci (ai, ajc)ci(ai f ai) 

F l (a kr a{) = Ri(a k ai ; a k ai) 

G l (a k} ai) = Ri(a k ai ; aia k ) 

In Tables 9*1 and 9*2 are listed the values of the a’s amd b’s for states 
involving s and p electrons. Values of the c r s, as well as more complete 
tables for the a ’ s and b’s , may be found in Condon and Shortley. 6 

As an example of these rules, let us calculate the energies of the terms 
arising from the configuration (2p) 2 . Referring to the list of B func- 
tions for this configuration, we see that the secular equation has the form 


- 0 9*68 


including eight determinants of the first order, two of the second, and 
one of the third. Now Z>i is a B function corresponding to the X D term, 
so that the energy of this term is immediately given by the root of the 
determinant in the upper left-hand corner as 

E( l D) = Hu 9*69 

Similarly, the energy of the 3 P term is immediately given as 

E( B P) = H' 22 9*70 

6 E. U. Condon and G. H. Shortley, Theory oj Atomic Spectra , Cambridge Uni- 
versity Press, p. 178, 1935. 
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TABLE 91 


a\l,m ; 


Electron 

Configuration 

Z 

m 

V 

m 

a 0 

a* 

88 

0 

0 

0 

0 

1 

0 

sp 

0 

0 

l 

±1 

1 

0 


0 

0 

l 

0 

1 

0 

pp 

1 

±1 

i 

±1 

1 

ih 


1 

±1 

l 

0 

1 

“THT 


1 

0 

l 

0 

1 

inr 


The remaining a’s are zero for s and p electrons. 

TABLE 9-2 
b l (l, m ; m) 

l m V m b° b l b 1 

0 0 0 0 1 0 0 

0 0 1 =tl 0 £ 0 

0 0 1 0 0 ^0 

1 +1 1 +1 1 0 A 

i -i i -i i o 

1 rfcl 1 0 0 0 

1 +11 -1 0 0 A 

1 0 1 0 10^ 

The energy of the *£ term will be given by one of the roots of the third- 
order determinant. The other two roots of this determinant correspond 
to the l D and 3 P terms. Now the sum of the roots of a determinant is 
equal to the sum of the terms on the principal diagonal, so that the energy 
of the l S term is given as 

E( l S) = H' 17 + Hi g + Hi* - H' n - H f 22 9*71 

Referring to the list of D functions, we readily find, by means of equa- 
tions 9-65-9*67 and Tables 9*1 and 9*2 that the pertinent matrix com- 
ponents of H are 

H' n - 2/(2, 1; 2, 1) + F°( 2, 1; 2, 1) + 2, 1; 2, 1) 

= 2/(2, 1; 2, 1) + F° (2, 1; 2, 1) - -&F\ 2, 1; 2, 1) 

-^(2,1; 2,1) 

= 2/(2, 1; 2, 1) + F° (2, 1; 2, 1) + ^F 2 (2, 1; 2, 1) 

H«> - 2/(2, 1; 2, 1) + F°(2, 1; 2, 1) + *F 2 (2, 1; 2, 1) 

H' M = 2/(2, 1; 2, 1) + F°( 2, 1; 2, 1) + ^F 2 (2, 1; 2, 1) 




FINE STRUCTURE 


151 


from which we calculate the energies for the various terms to be 
E( l D) = 2/(2, 1; 2, 1) + F°( 2, 1; 2, 1) + ^F 2 ( 2, 1; 2, 1) 9-72 

F( 3 P) = 2/(2, 1; 2, 1) + F°(2, 1; 2, 1) - ^F 2 (2, 1; 2, 1) 

- *#(2, 1; 2, 1) 9-73 

ECS) = 2/(2, 1; 2, 1) + F°(2, 1; 2, 1) + ^F 2 ( 2, 1; 2, 1) 

+ ^ 2 (2 ? 1; 2, 1) 974 

It is, of course, impossible to make an exact comparison of theory with 
experiment without evaluating the integrals for a particular choice of 
trial eigenfunctions. It is, however, possible to make some qualitative 
calculations. As the integrals F and G are positive quantities, we see 
that the orders of the levels will be 3 P, 1 D, 1 S, with the 3 P the lowest. 
This is in accord with Hund’s rule in atomic spectroscopy: Of the terms 
arising from a given electron configuration the term with the highest 
multiplicity will lie lowest; of terms with the same multiplicity, the one 
with the highest L value will lie lowest. For equivalent electrons, 
F*(nl, nl) = G l (nl, nl). For the electron configuration (np) 2 , our 
results state that 

ECS) - Em = -&F 2 + &G 2 = 3 
ECD)-E( 3 P ) &F 2 + 1 ? S G 2 2 

For the atoms C, Si, Ge, Sn, all having this electron configuration, 
Condon and Shortley give the values for this ratio as 1 . 13, 1 .48, 1 .50, 1 .39, 
respectively, although for certain other atoms having this same con- 
figuration the agreement is less satisfactory. Although this method of 
approach by no means gives exact quantitative results, it does give the 
number and type of terms correctly, and usually the relative positions 
of these terms are qualitatively correct. 

9h. Fine Structure. To the approximation we have been consider- 
ing, we have seen that the various B functions of a term all have the 
same energy. If, however, we add to our Hamiltonian operator the 
terms representing the interaction between the magnetic dipoles associ- 
ated with the motions of the electrons in their orbits and the magnetic 
dipoles associated with the spin of the electron, this is no longer true. 

We can determine the general form of this “ spin-orbit ” interaction 
by the following argument. Consider an electron moving about a 
nucleus of charge Z\e\ at a distance r. The electric field at the electron 
Z\e\ 

will be E = — y* r. If the electron is moving with a velocity v, it will 
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be subjected to the action of a magnetic field of strength 



Z\e\ 


r*v 


cr 


Ze 


mcr 


M 


9-76 


where M is the angular momentum of the electron. The classical 
energy of a dipole in a magnetic field is — p, • H, so that the energy cor- 
rection should be 




Z\t 


M * 


mcr 




A more careful analysis, 7 including a relativity correction, shows that this 


expression should be reduced by a factor Since \l x = — — S x , etc., 

me 

the spin-orbit interaction thus contributes a term to the Hamiltonian 

9-77 


I Ze 2 

H' = L ^-F2 — (M«-S XI + M yi s yi + M„-S fi ) 


i 2 m z c l rf 


Ze 2 


For the more general case of a non-coulomb field, the term — will be 

U 

replaced by - — — - , so that we obtain for the spin-orbit interaction 
Ti dr { 

operator the result 
H' = £ 


l fl dV(n) 


7 2m 2 c 2 In dri 


(MxtS X t 4 ~ + M zt S Z i) 9*78 


This operator commutes with J 2 and J z> but not with M 2 , S 2 or their 
components. The eigenfunctions of the complete Hamiltonian will 
not therefore be eigenfunctions of M 2 , S 2 , M z , or S*. If, however, the 
term H 7 is small compared to the other terms in the Hamiltonian, the 
energy levels of the complete Hamiltonian will approximate those of the 
approximate Hamiltonian which we have previously considered. There- 
fore each term will break up into a number of energy levels whose ener- 
gies will be close to that of the unperturbed term. The eigenfunctions 
corresponding to these energy levels will be approximately represented 
by linear combinations of the B functions corresponding to the unper- 
turbed terms. 

Since J 2 and J z commute with the complete Hamiltonian, the final 
eigenfunctions of H must also be eigenfunctions of those operators with 

h 2 0 h 

eigenvalues of the form J{J + 1 ) 7-9 for y and M — for J*. If we 

47 T 2 tt 


7 E. Kemble, Fundamental Principles of Quantum Mechanics , p. 502, McGraw- 
Hill Book Company, 1937. 
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'epresent the final eigenfunctions of H by C(J, M) t we must therefore 
lave 

] 2 C(J, M) = J {J + 1) C(J, M) 9-79 

47 r 

J Z C{J, M) = M C(J, M) 9-80 

Since J z = M* + S 2 , the B functions involved in C(J, M ) must have 
M l + Ms = M, since there is but one B function with a given Ml 
md Ms value. We may label the B functions by their Ml and Ms 
^alue as B(M l , Ms), so that the C’s must have the form 

C(J, M) = EHMl, M s )B{M Ly M s ) 9*81 

vhere Jc(Ml, Ms) is a constant. 

The values of the constants k are most easily found by means of the 
>perators 

J x db ijy = M x dt lNL v + S x zb iSy 

A r e have 

;jx =t tJv)C(J, M) = ZHMl, Ms)(M x ± iMy)B(M L , Ms) 

+ ZHM L , M s )(S x ± iS v )B(M l, M s ) 

rom which 

J(J + 1) - M(M ± 1)] H C(J, M ± 1) 

= ZKMl, M s )[L(L + 1) - M l {M l ± 1)) H B(M l ± 1, M s ) 

+ 2 'MMl, M s )[S{S + 1) - M S (M S ± 1)] h B(M l , M s ± 1) 9-82 

md therefore, from a knowledge of C(J, M), equation 9-82 gives 
J(J, M ± 1) and hence all C’s with the same J value. 

The highest value of M is M = L + S, and there is only one B func- 
,ion with this value of Ml + M s . Moreover, there is no B function 
vith a greater value of M . Hence if J = L + S 

C(J, J) = B(L, S ) 

md by 9-82 all the C(J, M) with J = L + S can be found. In particu- 
ar, C(J, J — 1) is found as a linear combination of B(L — 1, S) and 
3(L, S — 1). Now C(J r , J 1 ), where J' = L + S — 1, must also be a 
inear combination of the same two B functions; in addition it must be 
>rthogonal to C(J, J — 1). We therefore put 

C(J', J ') = k[B{L - 1, 8) + k' 2 B(L, S-l) 

Lf 

C(J, J - 1) = k x B(L - 1, S) + k 2 B(L, S-l ) 
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the orthogonality condition is, since the 5’s are normalized and 
orthogonal, 

kX + k 2 k 2 = 0 9-83 

which, with the condition k[ 2 + k' 2 = 1, determines k[ and k 2 . Having 
determined C(J' t J') we now determine all the C(j', M) by 9-82. 
We may now.determine C (J", J"), where J" = L + S — 2, by finding 
that combination of B(L — 2, S), B(L — 1, S — 1), and B(L, S — 2) 
which is orthogonal to C(J, J — 2) and C(J', J' — 1). This in turn 
gives all the C(J", M). This process is continued until 2S + 1 or 
2L + 1, whichever is smaller, sets of functions have been found, with the 
J values \L + <S|, \L + S| — 1 • • • \L — <S|. If we attempt to carry 
the process beyond this point, we find that the number of B functions 
corresponding to M — J ln+1) is only 2x + 1, where x is the smaller of 
L and S, and we already have (2x + 1) C(J (n) , M)’s. It is therefore 
inn possible to find a new linear combination of B’s which is orthogonal 
to the linear combinations already found. 

To illustrate this process, let us consider a 2 P term, so that L = 1, 
S = j. The B functions are 5(1, §), 5(1, -§), 5(0, £), 5(0, -£), 
5(— 1, J), 5(— 1, — \ ). The process therefore starts with C(§, -|) = 
5(1, f). Equation 9-82 (with the lower signs) then gives 

C(f, f) = 5(1, |) 

C(l \) = Vf6(0, h) + Vj5(l, -\) 

C( f, -i) = V^5(- 1, |) + V|5(0, -§) 

C(§, -$) = 5( — 1, -|) 

We now put 

C(h I) = fci5(0, §) + k 2 B(l, -i) 
and 9-83 gives for orthogonality to C(§, j) 

\Zfki + y/$k 2 = 0; k 2 — — \/2ki 

The values ki = and k 2 = — Vf satisfy this as well as the relation 
Jfcf + k 2 2 = 1. Hence 

C(h, h) = V? 5(0, |) - Vi 5(1, - J) 

If we apply 9*82 again, we obtain 

C(l -h) = Vf B( — 1, *) - 5(0, -i) 
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We have now formed six C functions out of the six B functions, and no 
more linear combinations can be made. The process is therefore 
complete. 

The energy levels derived from the C functions are dependent only 
on J and are the same for C functions with the same J but different M. 
Thus the four C functions C(f, -f), C(f, |), C(f, —5), C(f, — f) have 
one energy, and the two C(|, \) and C(J, — J) have a different energy. 
The energy levels are denoted by adding the value of J as a subscript to 
the term symbol. Thus the two energy levels arising from the 2 P term 
are denoted by 2 P H and 2 P 

In retrospect, the energy levels of an atom may be thought of as aris- 
ing from the various electron configurations. If the interactions 
between the electrons were completely neglected, the various eigen- 
functions arising from the configuration would have the same energy. 
The electrostatic interactions split the configurations into terms, denoted 
by their L and S values, with different energies. Finally, the spin-orbit 
interaction splits each term into 2L 4* 1 or 2S + 1 energy levels, dis- 
tinguished by their J values, which run from J L + S | to \L — S|. 
Each of these energy levels is still degenerate, having 2 J + 1 eigen- 
functions corresponding to M values which run from J to — J. As we 
shall see in the next section, this degeneracy can be removed by the 
application of an external magnetic field. 

In many atoms, particularly atoms of high atomic number and with 
almost complete shells, the spin-orbit interaction may become of more 
importance than the electrostatic interaction. In considering these 
atoms, it is a better approximation to neglect the electrostatic energy 
at first and consider the spin-orbit interaction alone for the first approxi- 
mation. We shall not treat this case here. Atoms of this type are said 
to have j-j coupling, as contrasted with the type we have been consider- 
ing, which are said to have L-S , or Russell-Saunders, coupling. 

9i. The Vector Model of the Atom. The results obtained above 
form a basis for a discussion of the “ vector model ” of the atom. This 
model is of value in visualizing some of the energy relations involved, as 
well as in giving a simple method of calculating the terms arising from a 
given electron configuration. From the configuration (np) 2 , we saw 
that one obtained the terms 1 Z), 3 jP, l S; that is, we had possible L 
values of 2, 1, and 0, and possible S values of 1 and 0. Since we had 
h = 1, h = 1, where l\ and l 2 are the l values of the two electrons, we 
see that the above L values are obtained by adding h and I 2 vectorially, 
if we require the resultant vectors to differ in length by steps of unity. 
In the same manner the above S values can be obtained by vector addi- 
tion of |*i | = | and | $2 1 “ §• The resulting L and S values cannot be 
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combined arbitrarily, since the exclusion principle must be taken into 
account. The possible terms can be determined by the following 
scheme. 8 We write as a table all the possible values of Ml which can 
be formed by combination of m\ and m 2 . 



The table shows that the values of M L are 

2 10 - 1-2 
1 0 -1 
0 

which are just the values required to form one D , one P, and one 5 term. 
The spins can be combined to form either 5 = 0 or 5 = 1. For 5=1, 
both electrons have the same spin quantum number, so that they must 
differ in their values of m. We cannot, therefore, combine any of the M l 
values from the diagonal of the above table with 5=1. Also, we can 
use only the Ml values from one side of the diagonal, as those on the 
other side merely correspond to a different numbering of the electrons. 
When 5 = 1, we are thus limited to the Ml values 1, 0, — 1, so that we 
have a 3 P term. When 5=0, the electrons differ in their spin quantum 
numbers and there is no restriction on the values of Ml which may be 
combined with this value of 5. As the set of values 1,0, —1 has 
already been used to form the 3 P term, we thus have the sets of Ml 
values 2, 1,0, —1, —2, and 0 to combine with 5 = 0, giving l D and l S 
terms. 

For the configuration (ns) 2 , there is, of course, only the one term l S. 
For the configuration (nd) 2 , our table for the M l values is 


*!= 2 

f»x= 2 1 0-1-2 



8 G. Breit, Pkys . Rev., 28 , 334 (1926). 
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For S = 1, we are again limited to the M l values from one side of the 
diagonal, which are sufficient to give 3 P and 3 P terms. For S — 0, 
we have the remaining sets of Ml values, which are sufficient to give 
x O , l D, X S terms. This method can be extended to more than two 
equivalent electrons, although the calculations are slightly more compli- 
cated. The possible terms that can arise from combinations of equiva- 
lent s and p electrons are 


(ns) 

2 S 



(ns) 2 

l S 



(np) 

2p 



(np) 2 

'S 

3p 

l D 

(np) 3 

*s 

2 d 

2p 

(np)* 

's 

Zp 

l D 

(np) 5 

2p 



(np) 5 

l s 




For such electron configurations as (rip, mp), the exclusion principle 
is already satisfied by the differing values of the principal quantum 
number. There is thus no restriction on the combinations of Ml and S 9 
so that this electron configuration gives the terms 3 D, 3 P, 3 S , 1 D ) 1 P J X S. 
Similarly, for the combination (np, nd) } the exclusion principle is satis- 
fied by the differing l values. From this configuration we thus obtain 
the terms 3 P, 3 D } 3 P, 3 S , X F 1 X D, X P, X S. 

* h 

For a given term we have an electronic angular momentum L* — » 


where |L*| = VL(L + 1); and a spin angular momentum S* — » 

2t 

where |S*| = vS(S + 1). As we have seen in the previous section, 
these angular momenta combine to give a total angular momentum 


J* — , where |j*| = Vj(j + 1), and where J has the values from 
2w 


\L + /S| to \L — S|, the J values differing by unity. These are just the 
J values we would obtain if we imagined the orbital and spin angular 
momenta to combine vectorially to give the resultant angular momen- 
tum, the resultant values being limited by the quantum principle. As 
we have seen, a magnetic moment is associated with the orbital angular 
momentum as well as with the spin angular momentum. Also, the 
motion of the electrons in their orbits produces a magnetic field. Ac- , 
cording to equation 9*78, we see that the spin-orbit interaction will be of 
the form 


= AL* • S* 


9-84 
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where A will be some function of L and S and the electronic state. Since 

L . ... irp - ri» - 1st 


this becomes 

AE = ^{J(J + l) - L(L + 1) - S(S + 1)} 9-86 

z 

The magnetic field produced by the orbital motion is in the same direc- 
tion as the orbital angular momentum. The state of lowest energy will 
be that in which the magnetic moment of the electron spin is parallel 
to the field. Now the magnetic moments of both orbital and spin 
angular momenta are in the direction opposite to that of the associated 
angular momenta, owing to the negative charge of the electron. The 
state of lowest energy will thus be that in which L and S are in opposite 
directions. The lowest state is therefore that with the lowest value of J. 
[In the list of terms arising from configurations of equivalent electrons 
as given above, it will be noted that ( np ) 6 gives the same terms as (np); 
that is, a closed shell minus one electron gives terms similar to those from 
the same shell containing only one electron. In terms arising from 
these “ almost-closed shells ” the order of the states is reversed, the 
lowest state being that with the highest J value. For example, the 
halogens, with the (np) 5 configuration, have 2 P H as the ground state.] 

Since L and S are the same for all members of a multiplet, the sepa- 
ration between the state with quantum number J and that with J + 1 is 

^{{J + l)(/ + 2) - J(J + Dl = A(/ + l) 9-86 

that is, it is proportional to the larger J value. For example, the states 
®P 0 , 3 Pi, 3 P 2 have relative separations 1 : 2; the states 4 Z) W , 4 Z), 4> 
4 Dj$, 4 Dj 4 have relative separations 3:5:7. This relation, the Lande 
interval rule, is closely obeyed in atoms with Russell-Saunders coupling. 

As stated above, a state with angular momentum quantum number J 
is (2J -f l)-fold degenerate, there being this number of C functions 
with the same value of J. Returning to our example of the terms 
arising from the electron configuration (np) 2 , we have 1 S 0 , non-degener- 
ate; 1 Z> 2 , 8 P 2 > five-fold degenerate; 8 Pj, three-fold degenerate; 3 Po, 
non-degenerate. This accounts for the fifteen states corresponding to 
the possible linear combinations of the fifteen D functions for this con- 
figuration. If the atom is now placed in a magnetic field, the energy will 
depend on M as well as on J, and this degeneracy will be removed. If 
ps* is the magnetic moment associated with a state with quantum 
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number J, the energy in a magnetic field will be 

E(J t M) = Eq(J) — | ij* • H 9*87 

where E 0 (J)ia the energy in the absence of the field. As we have noted 
previously, the magnetic moment associated with the orbital motion is 




eh 

4:7rmc 


that associated with spin angular momentum is 


(ts* = 2S* 


eh 

4t7TIC 


The magnetic moment associated with the total angular momentum is 
therefore 




- jpj(k*l cos (L* J*) + || 1 5 .| cos (S* J*)) 

■wit &h 


- oJ* 


9-88 


where 


g =* I + 


/(/ + 1) + ,S0S + 1) -L(L + 1) 
2 J(J + 1) 


is the “ Lande g factor/' The energy levels in the presence of a mag- 
netic field are therefore 


E(J, M) = E 0 (J) - g 
= E 0 (J) + g 


eh 

4tvic 
4 rmc 


J* • H 

HM, M = J, J 


J 9-89 


Each level with quantum number J is thus split into 2J + 1 equally 
spaced levels by the magnetic field, the lowest level being that with 
M *= —J. In Figure 9-1 we have plotted the levels arising from the 
configuration (np) 2 for the various degrees of approximation. For each 
degree of approximation, the relative separations within a given multi- 
plet are correct; the other separations are not to scale. 

9j. Selection Rules for Complex Atoms. As the field of atomic 
spectroscopy lies to a certain extent outside the scope of this book, we 
shall not treat the selection rules for the allowed transitions in complex 
atoms in any detail but shall merely summarize some of the more impor- 
tant results. According to the general theory of Chapter VIII, a transi- 
tion between two states a and b may take place with the emission or 


absorption of electric dipole radiation only if the 


integral 
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different from zero. Now any atom has spherical symmetry; the 
Hamiltonian operator is unchanged if the position coordinates of every 
electron are subjected to an inversion through the origin of coordinates. 
Because of this spherical symmetry, the product must be unchanged 






No electronic Electrostatic Spin-orbit Magnetic 

interaction repulsion interaction field 

included included present 


Fig. 9*1. Schematic energy levels from the electron configuration (np) 2 . 


if every coordinate is replaced by its inverse. Upon inversion, therefore, 
the function must either remain unchanged or must change only in 
sign, so that the atomic wave functions can be classed as even or odd 
with respect to inversion. The vector r is, of course, an odd function 
with respect to this operation. Now, if and ipb are both even func- 
tions or both odd functions, the product will be an odd function. 
If we carry out an inversion, the integrand will change sign, its magni- 
tude, however, remaining unchanged. Thus for every contribution to 
the integral from a particular volume element dii there will be a con- 
tribution of equal magnitude but opposite sign from the inverse volume 
element; the integral will be identically zero. We have thus derived the 
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selection rule for dipole radiation: only transitions between an even 
state and an odd state are allowed. This selection rule is perfectly 
general, depending as it does only on the symmetry properties of an 
atom and not on the particular set of wave functions which we use to 
give an approximate description of the electronic state of the atom. The 
following selection rules are less strict but hold approximately for many 
atoms. 

If we consider that the states of the atom can be described by wave 
functions which are the proper linear combinations of products of one- 
electron orbitals which are solutions of a central field problem, we can 
make the following statements. Many transitions can be described by 
assuming that only one electron is involved. The angular portions of 
the orbitals are then identical with those of the hydrogen atom, the 
selection rule will therefore be the same, so for a one-electron transition, 
to this approximation, we have the rule Al = ±1. For example, a 
transition of the type (np) 2 —> (np, md) is allowed; transitions of the 
type (np) 2 — > (np, rap) or (np) 2 — > (np, mf) are forbidden. As may 
readily be seen, either from the expressions for the hydrogenlike wave 
functions or from the nature of the spherical harmonics, these functions 
are even for l even and odd for l odd. To the approximation we are here 
considering, an atomic wave function is therefore even or odd according 
as is even or odd. If we have a transition involving two electrons 

i 

simultaneously, then the general theorem that transitions are allowed 
only between even and odd states tells us that Al must be even for one 
electron and odd for the other. Since we have Al = =bl for a one- 
electron transition, for a two-electron transition we would have 
Ali = dbl; AU == 0, ±2. 

Of more general interest are the selection rules for S, L, and J, as 
these quantum numbers describe an atomic state with greater accuracy. 
To the approximation that we neglect spin in the Hamiltonian operator, 
the spin wave functions are independent of the coordinate wave func- 
tions. The dipole moment integral will vanish because of the orthogo- 
nality of the spin functions unless the spin quantum numbers match in 
the initial and final states. To this approximation, we thus have the 
selection rule AS = 0; that is, only transitions between terms of the 
same multiplicity are allowed. The selection rules for L and J cannot 
be derived so simply; the results are 9 

AL = 0, dbl; AJ = 0, ±1 9*90 

9 E. Kemble, Fundamental Principles of Quantum Mechanics , p. 543, McGraw- 
Hill Book Company, 1937. 
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In the presence of a magnetic field, the energy depends also on M. The 
observed phenomena in the Zeeman effect are explained by the selection 
rule for M 


AM = 0, dr 1 


9-91 


this selection rule being derivable in the same manner as the selection 
rule for m in the theory of the hydrogen atom. The selection rule 
AM = 0 corresponds to the non-vanishing of the z component of the 
dipole moment integral; the light emitted during a transition for which 
AM = 0 is polarized along the z axis (the direction of the magnetic field). 
Similarly, for transitions for which AM = =bl, the emitted light is 
circularly polarized in the xy plane. In passing, we might mention that 
investigations of the Zeeman effect provide one of the most powerful tools 
for the determination of the characteristics of the states involved in 
atomic spectra. 

9k. The Radial Portion of the Atomic Orbitals. Up to the present 
we have not specified the exact nature of our atomic orbitals, aside from 
the specification that the angular portion of the orbitals would be the 
ordinary spherical harmonics, since they were assumed to arise from the 
solution of a central field problem. For any quantitative calculation of 
energy levels some form must be chosen for the radial portion of the 
orbital. The best one-electron orbitals are found by the method of 
Hartree, which is discussed in the next section. In approximate work 
it is often desirable to use orbitals which, although less accurate than 
those obtained by Hartree’s method, are simpler in form and hence 
easier to use. For example, Zener 10 and Slater 11 have used orbitals of 
the form 

Nr (n *~ 1) e~ (Z ~' ) "^<‘Y (l, m\d, v ) 9-92 


where n* and s are adjustable constants and N is a normalizing factor. 
These eigenfunctions are solutions of the central field problem where 
V(r) is given by the relation 


V(r) 


(. Z - s)e 2 n*(n* - 1 )h 2 
r + 8r 2 mr 2 


9-93 


For large values of r this approaches 


F(r) 


( Z - *)e 2 
r 


9-94 


corresponding to a screening of the nucleus equivalent to a atomic units; 

10 C. Zener, Phyt, Rev., 86, 51 (1930). 

11 J. C. Slater, Phyt. Rev., 86, 57 (1930). 
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in other words, the effective nuclear charge is equal to Z s. From its 
resemblance to the quantum number n of the hydrogen atom, n* is 
known as the “ effective quantum number.” Qualitatively, the eigen- 
functions 9*92 differ from the hydrogen eigenfunctions in that there are 
no nodes in the radial portion, whereas the hydrogen eigenfunctions 
have n — l — 1 nodes. 

By varying n* and s so as to minimize the energy, Slater has been 
able to give the following rules for the determination of these constants: 

1. n* is assigned according to the following table, according to the 
value of the real quantum number n: 

n = 1 2 3 4 5 6 
n* = 1 2 3 3.7 4.0 4.2 

2. For determining s, the electrons are divided into the following 
groups: Is; 2s, 2p; 3s, 3p; 3d; 4s, 4p; 4:d,4f; ••• 

3. The shielding constant s is formed, for any group of electrons, from 
the following contributions: 

(а) Nothing from any shell outside the one considered. 

(б) An amount 0.35 from each other electron in the group considered 
(except in the Is group, where 0.30 is used). 

(c) If the shell considered is an s or p shell, an amount 0.85 from each 
electron with principal quantum number less by 1, and an amount 1.00 
from each electron still farther in; but if the shell is a d or / shell, an 
amount LOO from each electron inside it. 

For example, carbon has two Is electrons, two 2s electrons, and two 2p 
electrons. The approximate radial orbital (unnormalized) for a Is 
electron is, according to the above rules: 

s -5.70L 

^(ls)^e a ° 

while for a 2s or 2 p electron the radial function is 

3.25 r 

(p( 2s) = <p(2p) ^re 2o ° 

Since the constants in these functions were determined by the use of 
experimental data, the functions will be satisfactory for rough quantita- 
tive calculations. Various other sets of screening constants have been 
proposed for use in wave functions of the above type, for example, those 
of Pauling and Sherman. 12 

91. The Hartree Method. According to our discussion of the helium 
atom in Chapter VII, the best wave functions for an atomic system 

11 L. Pauling and J. Sherman, Z. Kristy 81, 1 (1932). 
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should include the distance between the electrons r t; *, explicitly. 
Because of the complexity that would result from the use of such wave 
functions for many-electron atoms, we are limited in practice to the use 
of the one-electron wave functions of the type 9*19. The best possible 
wave functions of this type would be obtained by means of the variation 
method. In the earlier work of Hartree, the simpler product form of 
one-electron functions was used instead of the determinantal form. If 
we write the wave function for an n-electron atom as 


4 / = <pi(ai 1 1 («2 1 2) • * •^n(a»]ft) 9-95 


then the best wave function of this type is that which makes the energy 
E = J* 4/* Hyp dr a minimum, in other words, that which satisfies the 


equation 



9*96 


The above method of expressing the energy is valid only if ^ is normal- 
ized. We can insure that this condition is fulfilled if we require that 
each <p be normalized, that is 


/ 


(Pi ^fi% dr 1 , 1/ I, 2, 


9*97 


The Hamiltonian operator for the system of n electrons is 


H = £ 


h 2 


-87 r 2 m 


V? 


U 


+ IEE- 


i j&iTij 


n J 2 

= EH,- + |ZZ- 

»“I i jyil'ij 


9-98 


The solution of equation 9-96, subject to the condition 9*97, shows that 
the best possible wave function of the type 9-95 is obtained by using the 
set of <p ’ s which are the solutions of the n simultaneous equations 

H ifpi + | £ e 2 dr\ pi = tffi 9*99 

l v T{j j 

Now e 2 |<f>/| 2 is just the charge distribution of the jth electron, and 

„L,I 2 

e z — - is the potential energy of the ith electron in the field of the jth 

r ij 

electron. The term in brackets thus represents the potential energy of 
the ith electron in the field of all the remaining electrons. The method 
of solution of the set of simultaneous equations is the following. First 
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an arbitrary set of p/s is chosen (the choice, of course, being guided by 
any previous knowledge of approximate wave functions for the atom). 
The field arising from all the electrons except the zth electron is calcu- 
lated from this set of <p’8, and w is then calculated from equation 9*99. 
This procedure is carried out for each of the n electrons. The calculated 
set of <p’s will not in general be identical with the original set; the calcu- 
lations are repeated with a new set of <p s which have been altered in a 
manner suggested by the results of the first calculation. This process is 
continued until the assumed set of ps and the calculated set are identical, 
at which point the solution of the set of simultaneous equations has been 
achieved. Since an assumed charge distribution gives a set of ^’s 
which will reproduce this charge distribution, the above method is fre- 
quently called the “ method of the self-consistent field.” 

The energy associated with the wave function 9*95 is 


E 


= J* dr 

= Z vfB&t dn + §£ E e 2 f dr . dr . 9. 

»-»! » jj*i *J r {j 


Comparing this equation with 9*65, we see that it contains terms repre- 
senting the energy of the one-electron wave functions and terms repre- 
senting the “ coulombic ” interaction of the electron cloud distributions, 
but that the term representing the “ exchange ” energy is missing. If 
the ^’s are the solutions of equations 9*99 the energy will be given by 


E 


= £*i-hZZe 2 f 9 

t = l i j*i v o Tij 


101 


The reason why there are no energy terms in 9T00 corresponding to the 
exchange energy is that the original wave function ^ was written as a 
simple product rather than as a determinant, as it should be if the exclu- 
sion principle is to be satisfied. By writing \p as a determinant, and pro- 
ceeding as above, with the additional requirement that the «s’s be orthog- 
onal, one is led to Fock’s equations, 13 which are s imil ar in form to the 
Hartree equation 9-99, but contain additional terms corresponding to the 
potential energies arising from the electron interchange. The results 
for the energy levels and wave functions do not differ appreciably from 
those obtained by the Hartree method. Since our main interest is in 
the problem of molecular structure rather than in that of atomic struc- 
ture, we shall postpone a more thorough discussion of exchange energies 
to later chapters, where they will be of more significance. 

18 V. Foek, Z. Physik, 61, 126 (1930). 



166 ATOMIC STRUCTURE 

It should be mentioned at this point that, although the above wave 
functions, particularly those obtained by the solution of Fock’s equa- 
tions, are the “ best possible ” one-electron wave functions, they give 
values for the energies of atoms which are incorrect by approximately 
0,5 volt per electron. For this reason any results obtained from calcu- 
lations based upon one-electron wave functions can be only quali- 
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Pig. 9-2. Electron distribution in He, Na , and Rb + by the Hartree method. 

tatively correct; it is somewhat unfortunate that these are the only 
wave functions which can be used in most problems concerning the 
energy levels of complex systems. 

In Figure 9-2 we show the electron density distribution, as calculated 
by the Hartree method, 14 for He, Na + , and Rb + . It will be noted that 

u D. if. Hartree, Proc. Cambridge Phil. Soe., 24, 89, 111 (1928). 
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the various shells of electrons can be clearly distinguished, and that 
the “ radii ” of these shells decrease as the nuclear charge Z increases. 


The diameter of the first shell, the K shell, is essentially ~ , where Oq is 


the radius of the first Bohr orbit. In Figure 9-2 the electron density 
D (r) is normalized in such a way that the area under each curve is equal 
to the number of electrons in the atom. 

9m. The Periodic System of the Elements. We will conclude this 
chapter on atomic structure with a brief discussion of the periodic system 
of the elements from the viewpoint of the electronic states of the various 
elements. In Figure 9*3 we have reproduced the periodic chart, the 
arrangement being essentially that suggested by White 15 on the basis 
of the spectroscopic characteristics of the various elements. In this 
table are included the electron configurations of the ground states of 
the elements, the designation of the lowest level, and the ionization 
potentials 10 both numerically and graphically. As a matter of con- 
venience, the members of the rare-earth group between La and Lu have 
been omitted . In giving the electron configurations, only those electrons 
which have been added to the electrons of the preceding rare-gas con- 
figuration have been explicitly noted. For example, the electron con^ 
figuration of Ne is (ls) 2 (2s) 2 (2p) 0 ; the designation (3s) for the elec- 
tron configuration of Na implies (l$) 2 (2s) 2 (2p) 6 (3s). 

As mentioned in section 9c, the electron configuration of the ground 
state of any atom is obtained by adding the electrons one at a time to the 
lowest possible orbitals, taking account, however, of the exclusion 
principle. As previously mentioned, the orbitals lie approximately in 
the order Is, 2s, 2 p, 3s, 3 p, 4s, 3d, 4 p, 5s, 4d, 5 p, 6s, 4/, 5 d, 6 p, 7s, 6d. 
In the hydrogen atom the energy depends only on n, so that, for example, 
the 3s, 3 p, and 3d orbitals all have the same energy. The new order 
of the orbitals in the heavier atoms may be qualitatively understood in 
the following way. The outer electrons move in an effective field which 
is the resultant of the nuclear field and the field of the inner electrons, or, 
as we have previously said, the inner electrons “ screen ” the outer elec- 
trons from the nucleus. The energy binding an outer electron to the 
atom will thus depend on how effectively this electron is screened from 
the nucleus; for example, if a 3d electron is more completely screened 
than a 3 p } and the 3 p is more completely screened than a 3s, then the 
order of the orbitals will be 3s, 3 p, 3d, with the 3s orbital the most stable. 
This is exactly the case in heavy atoms. If we look at the hydrogenlike 

u j£ jj White, Introduction to Atomic Spectra , 85, McGraw-Hill Book Com- 

pany, 1934. 

16 G. Herzberg, Atomic Spectra and Atomic 287, Prentice-Hail, 1937. 
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Fig. 9-3. The periodic system of the elements. 
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eigenfunctions in Table 6*1, we see that, disregarding the constant 
factor, the angular factor, and the common exponential factor, for small 
values of r the 3s eigenfunction is a constant, the 3p eigenfunction varies 
as r, and the 3d eigenfunction varies as r 2 . This means that the proba- 
bility of finding the electron in the immediate neighborhood of the 
nucleus is greatest for the 3s electron and least for the 3d electron, so 
that the 3s electron is screened least by the inner electrons and the 3d 
is screened most; the order of stability is thus 3s, 3 p, 3d. Because of 
this difference in energy between the 3 p and 3d, it is possible for the 4s 
orbital to be even more stable than the 3d, and we observe that this is so. 
Similar arguments hold for the relative stability of / orbitals; it is 
observed that the 4 / orbitals are so completely screened by the inner 
electrons that the 6s orbital is more stable. 

By adding electrons into the various possible orbitals, taking proper 
account of the exclusion principle, we get the electron configurations 
as shown in Figure 9*3. The Is shell is complete at He, the 2s shell at 
Be, and the 2 p shell at Ne. From the viewpoint of electron configura- 
tion alone, He should properly be in the same column as Be; from the 
viewpoint of chemical properties, it belongs with Ne. This point will 
be discussed later. The 3s shell is completed at Mg, the 3 p shell at A, 
and the 4s shell at Ca. At this point the 3d shell begins to fill up. Since 
the 4s and 3d orbitals are close together, it happens that the 3d shell does 
not fill up smoothly; there is a tendency for the 3d shell to fill up at the 
expense of the 4s shell. At Cu the 3d shell is complete, and there is one 
4s electron present; at Zn both the 3d and the 4s shells are filled. The 
4p electrons are now added, this shell being completed at Kr. The 
same procedure is repeated with the 5s, 4d, and 5 p electrons. The 6s 
shell is completed at Ba, then one 5d electron is added, followed by the 
fourteen 4 f electrons. The 4 f shell is completed at Lu, which, as far as 
electrons outside closed shells are concerned, has the same configuration 
as La. The addition of the remaining 5 d electrons and the addition of 
the 6 p electrons brings us finally to Rn. Little is known about the 
remaining elements. 

We now note that all the elements in a given column have the same 
ground state. For example, the elements H, Li, Na, K, Rb, and Cs 
have 2 S h as the ground state; the elements Zn, Cd, and Hg have 1 S 0 
as the ground state; the elements C, Si, Ge, Sn, and Pb have 3 Po as the 
ground state. Since the chemical properties of an element are deter- 
mined by the electron configuration of the element, we see why the ele- 
ments in any given column have similar chemical properties. The 
above statements are not strictly true for the elements in those portions 
of the table where the d shells are being filled. Owing to the competition 
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between the d and s shells, certain irregularities arise; for example, 
while Ti, Zr, and Hf have the same electron configuration, that of Cb 
differs from that of V and Ta. In the first four and last eight columns 
the correspondence is exact; the intermediate columns show more or less 
variation. 

We also note that the chemical characteristics of an element are not 
completely determined by the electron configuration of the ground state, 
since, for Example, K and Cu both have 2 S ^ as the lowest level, and 
Ca, Zn, and Kr all have 1 >S r o* In order to understand the differences 
between such elements several other factors must be considered. The 
differences between K and Cu are undoubtedly to be attributed to the 
fact that in Cu we have a 4s electron on top of the rather loosely held 
3d shell, whereas in K we have a 4s electron on top of the tightly held A 
configuration. The result of this is that the 4s electron in Cu moves in a 
stronger effective field than the 4s electron of K, hence the ionization 
potential of Cu is considerably higher than that of K. The 4s electron 
in K is therefore more readily removed than that of Cu, so that, though 
there are many similarities between the two elements, K is the more 
reactive. 

As will be seen in the following chapters, the normal covalence of an 
atom is equal to the number of unpaired electron spins in the atom. 
F, 0, and N have multiplicities of 2, 3, 4, respectively, so that the spin 
quantum number S has the values J, respectively, for these atoms. 
These values of S can arise from 1, 2, 3 unpaired spins, so that F, O, 
and N have normal covalences of 1, 2, and 3, respectively. Be, on the 
other hand, should have a normal covalence of zero. However, com- 
paring the ionization potentials of Be and B, we see that the 2 p orbital 
is only slightly less stable than the 2s. One of the 2s electrons in Be 
can thus be rather readily moved up to the 2 p level, giving two unpaired 
spins and hence a covalence of two. Similarly, by moving one electron 
from the 2s to the 2 p level in B and C, we obtain the normal covalences 
of 3 and 4 for these elements. The behavior of Zn is analogous to that 
of Be. For the rare-gas elements, a quite different condition exists. 
For example, the ionization potential of Ne is 21.45 e.v., while that of 
Na is only 5.12 e.v. There is thus a great gap between the 2 p and 3s 
levels. The promotion of a 2p electron in Ne to the 3s level, which 
would give Ne a covalence of 2, requires so much energy that this con- 
dition does not occur. Similarly, there is a large gap between the Is 
and the 2s levels; for this reason He has the chemical properties of Ne 
rather than of Be. 

As we fill in a given shell, say the 2 p shell, the ionization potential 
increases with increasing atomic number; since the electrons in a given 
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shell screen each other very little, the effective nuclear charge is increas- 
ing. An exception will be noted in the oxygen group; the ionization 
potential of 0 is less than that of N. As long as there are three or less 
electrons in a p shell they can all be in different ones of the three available 
P orbitals (p x , p y , p*), so that the electrostatic repulsions are a minimum. 
The fourth p electron must go into an orbital which is already occupied 
by one electron; the increased electrostatic repulsion causes the electron 
to be bound less firmly than would be expected from considerations of 
effective nuclear charge alone. 



CHAPTER X 


GROUP THEORY 

The Schrodinger equation can be solved exactly in only a very few 
simple cases; in general, we are limited to the approximate methods 
of solution discussed in Chapter VII. There exists, however, a large 
class of results which depend only on the symmetry properties of the 
system under consideration; these results can be obtained exactly by 
use of the branch of mathematics known as group theory. We pre- 
sent in this chapter an elementary treatment of group theory, although 
we make no claim to completeness. Several of the important theorems 
are presented without proof; the interested reader may refer to one of 
the complete expositions of the field (see General References). Group 
theory, in the form which will be of interest to us, makes considerable 
use of matrix notation, so that we shall first present the elements of 
matrix algebra, with particular reference to the matrices involved in 
linear transformations of coordinates. 

Y 



10a. Matrices. Let us consider a point in the xy plane, this point 
being specified by the coordinates (x\ t 2/1) (Figure 10 * 1 ). These two 
numbers may also be thought of as defining the vector ri. A rotation 
of this vector through an angle 0 will transform it into the new vector 
r 2 , defined by the numbers x 2 and y 2 , The relation between (x 2) 2/2) 
and fa, 2/1) and the angle of rotation 6 is given by the set of linear 
equations 

x 2 = X\ cos 6 — 2/1 sin 6 


y 2 = xi sin $ + y\ cos $ 
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The reverse transformation is 


cos 0 , sin 9 
x t = x 2 —— + U 2 —r~ 


sin 0 , cos 0 
2/i = -x 2 A -+y 2 — 

where A is the determinant of the coefficients in 10* 1 : 


cos 0 — sin 0 
sin 0 cos 0 


= cos 2 0 + sin 2 0 = 1 


The set of equations 10* 1 can be written as 

/ /cos 0 — sin 0\ / 


where 


(;)•(: 


cos 0/ \2/i 


f cos 0 — sin i 

isin0 cosi 


is the matrix of the transformation which takes r x into r 2 . The corre- 
sponding matrix for the reverse transformation is 


cos 0 sin i 


sin 0 cos I 


In the n-dimensional case, we have the corresponding set of equations 
x[ = an*! + a 12 x 2 4 1- a ln x„ 

X 2 = 021*1 + 022*2 + • • • + a 2 »*n 106 


x n = Gnl^l + 0*2^2 + * • • + &nn x n 

where the x+’s are the new coordinates and where the a 1 s satisfy the 
relations 


» 


Z4 = i 

y-i 

ft 

fc-1,2,- 

* * n 


II 

I'M 

*■* a 

ii 

fc - 1, 2, • 

* • n 

10-7 

2>y*o,-, = 0 
i-1 

n 

k,l — 1,2, • 

* • n; k t* l 


Za W Oy = 0 
*»1 

jfc, f = 1, 2, • • 

• n; k I 
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la addition, the determinant of the a’s is unity. The matrix formed 
by a set of a's which satisfy the relations 10*7 is said to be a unitary 
matrix; the matrices representing rotations, reflections, and inversions 
are unitary. It will be noted that the matrices 10*3 and 10*4 
satisfy the relations 10*7. The transformation which is the reverse 
of 10*6 is given by the set of equations 

xi = a n x[ + 0 , 21 X 2 + • • • + a n iXn 

x 2 = di 2 x[ + a 2 2 x 2 + * * * + a n2 x' n 10*8 


Xn = d Xn x[ + a 2n X 2 H + a nn Xn 

The matrix of the reverse transformation is thus obtained from the 
matrix for the original transformation merely by changing columns 
into rows. 

The set of equations in 10-6 can be written in the compact form 

x f j = Y,djkXk k, j = 1 , 2 , • • * n 10*9 

An even simpler notation which expresses the same thing is obtained 
if we write 

x = ax 10*10 

where a is the matrix 

fan d\2 • • • dln\ 
d2l d 22 * * * 02n 

H 

\dnl d n2 * * • a nn J 

of the transformation from the unprimed to the primed coordinates. 
A second transformation could be written as 

x" = bx r ; or x $ *' = Yfiijx'j 10*11 
; 

These two successive transformations are equivalent to some one trans- 
formation 

x " == cx; or x" = T,c ik x k 10*12 

k 

Combining the above equations, we have 

Xi ** Y,bijXj = ” liL,CikXk 10*13 

j i k k 

The components of the product matrix c = ba are thus given by the 
relation 


Ci k = T,bi ijajk 


10*14 
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which is the rule for matrix multiplication. The product of the ma- 
trices of the transformations 10-6 and 10*8 must be a matrix which 
represents no transformation of coordinates at all, that is 


/ ®11 

ai2 • 

' * &ln\ 

( ®11 

a 2 i * 



A 

0 • 

• o\ 

«21 

a 22 * 

’ * «2n 

a 12 

a 22 * 

’ * &n2 

= 

0 

1 • • 

• 0 

V*»i 

a n2 • 

* * ^ U n n/ 

Vln 

0>2n * 

* * Clnn) 


\o 

0 • ■ 

• V 


From the rule for matrix multiplication, and the relations in 10*7, we 
see that the matrix equation 10*15 is indeed true. 

Before proceeding with the formulation of group theory, we need 
certain general concepts concerning vectors. In three-dimensional 
space, any three numbers may be thought of as defining a vector, the 
vector from the origin of coordinates to the point specified by the three 
numbers. If we have the two vecors A and B, defined by the num- 
bers (Ai t A 2 , A3) and (B 1, B 2 , £3), the vectors are said to be orthog- 
onal if 


AyB\ + A 2 B 2 + A3B3 = 0 

In the three-dimensional case, this means that vectors are perpen- 
dicular, or, in the notation outlined in Appendix II, the scalar product 
A • B is zero. In more general terms, we may consider n numbers 
(Ai, A 2 * • • A n ) as defining a vector A n in n-dimensional space. If 
B„ is another n-dimensional vector, the two vectors are said to be 
orthogonal if their scalar product 

A„ • B u = AiJ?i + A 2 B 2 + • • • + A n B n 

is equal to zero. If the numbers which define the vectors are com- 
plex, the vectors are said to be orthogonal if the Hermitian scalar 
product 

(An • B n ) = A*Bi + AtB 2 + * * * + A*B n 


is zero. 

In three dimensions any arbitrary vector can be expressed in terms 
of a linear combination of three orthogonal vectors, for example, the 
three unit vectors along the coordinate axes. In other words, it is 
possible to construct only three independent orthogonal vectors in 
three-dimensional space. Analogously, in n-dimensional space, it is 
possible to construct only n independent vectors. This concept of a 
set of numbers defining a vector will prove useful later. 

10b. The General Principles of Group Theory. The set of opera- 
tions which send a symmetrical figure into itself are said to form a 
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group. Let us consider the symmetrical figure formed by three points 
at the comers of an equilateral triangle, as in Figure 10*2. The opera- 
tions which send this figure into itself are: 

1. The identity operation E , which leaves each point unchanged. 

2. Operation A f which is a reflection in the yz plane. 

3. B — reflection in the plane passing through the point b and per- 
pendicular 'to the line joining a and c. 

4. C — reflection in the plane passing through c and perpendicular 
to the line joining a and b . 

5. D — clockwise rotation through 120°. 

6. F — counterclockwise rotation through 120°. 



Other symmetry operations are possible, but they are all equivalent 
to one of the operations given above. For example, a clockwise rota- 
tion through 240° is a symmetry operation, but it is identical with 
operation F; a rotation through 180° about the y axis is identical with 
operation A. 

The successive application of any two of the operations listed above 
will be equivalent to some single operation. Rotation in the clock- 
wise direction through 240° is obtained by applying operation D twice; 
this is equivalent to the single operation F — we denote this fact by 
the equation DD = F. Operation A interchanges points b and c; if 
operation D is applied to the resulting figure, c is returned to its original 
position, b goes to the position originally occupied by a, and a goes to 
that originally occupied by b . Operation A followed by operation D 
is thus equivalent to operation C, or DA = C . If we work out all 
jkMBible products of two operations, we obtain the following multi* 
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plication table, Table 10-1, where the operation which is to be applied 
to the figure first is written across the top of the table. The set of 
operations E, A, B, C, D, F forms a group, and Table 10*1 is known 
as the multiplication table for this group. The number of operations 
in the group is called the order h of the group; here the order of the 
group is 6. 

TABLE 10-1 

E A B C D F 

E 
A 
B 
C 
D 
F 

More generally, any set of elements P, Q, R, S * • • is said to form a 
group if the following conditions are satisfied: 

1. The product of any two elements in the set is another element in 
the set. 

2. The set must contain the identity operation E which satisfies the 
relation ER = RE = 2?, where R is any element of the set. 

3. The associative law of multiplication, P(QR) = ( PQ)R , must 
hold; that is, P times the product of Q and R must be equal to the 
product of P and Q times R. 

4. Every element must have a reciprocal such that, if 2? is the re- 
ciprocal of S, then RS = SR = E . 

That all these conditions are satisfied by the group given above is 
easily verified. The commutative law of multiplication does not neces- 
sarily hold. From Table 10*1 we see that AB = D ; BA = F, so that 
AB BA . If PQ = QP for all elements of the group, the group is 
said to be Abelian. 

In the group of symmetry operations on three points as given above, 
we have three distinct types of operations: the identity operation E; 
the reflections A, B , and C ; and the rotations D and F . We say that 
each of these sets of elements forms a class; that is, E forms a class by 
itself, At B t and C form a class, and D and F form a class. Usually 
the geometric considerations will enable us to pick out the classes; 
more precisely, two elements P and Q which satisfy the relation 
X~ l PX * P or Q, where X is any element of the group and X" 1 is 
its reciprocal, are said to belong to the same class. From the multi- 
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plication table, we have 


EDE 

= D 

EFE = F 

ADA 

= F 

AFA = D 

BDB 

= F 

BFB = D 

CDC 

= F 

CFC = D 

FDD 

= D 

FFD = F 

DDF 

= D 

DFF = F 


The elements D and F therefore form a class; in the same way it is 
found that A, B, and C form a class, and that E forms a class. If the 
group is Abelian, then X~ l PX = X~ l XP = P for all X’s and P’s. Each 
element of the group then forms a class by itself, and the number of 
classes is equal to the number of elements. The concept of a class of 
operations has the following geometric meaning. If two operations 
belong to the same class, it is possible to pick out a new coordinate 
system in which one operation is replaced by the other. For example, 
in the group given above, we could equally well have taken our y axis 
through the point b and perpendicular to the line joining a and c. The 
operation A in the new coordinate system is the same as the operation 
B in the old coordinate system, since A has been defined to be a re- 
flection in the yz plane. 

Any set of elements which multiply according to the group multi- 
plication table is said to form a representation T of the group. For 
the group given above, we immediately see that the sets of numbers 
assigned to the various elements in the following way form representa- 
tions of the group: 

E A B C D F 

111111 

1 -1 -1 -1 1 1 

The corresponding matrices will also form a representation of the group 
if we replace ordinary multiplication by matrix multiplication. If we 
denote by e, a, b, c, d, f the matrices of the transformations of coordi- 
nates associated with the corresponding operations, we see that these 
matrices form a representation of the group. That is, the product of, 
say, A and B is AB = D ; the product of the matrices a and b must 
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therefore be ab = d, so that the matrices multiply according to the 
group multiplication table. We have therefore found three matrix 
representations: 



E 

A 


B 


C 

D 


F 


ri 

(1) 

(1) 


(1) 


(1) 

(1) 


(I) 


r* 

(1) 

(-1) 


(-1) 


(-D 

(1) 


(1) 






1 


1 ^3\ 

/ „1 


/I _ 


1 

1 °\i 

f-1 0 \ 


2 

2 

2 2 

2 

2 

2 

2 

r 8 











\ 

1/ 

0 +1/ 


v^3 

1 

v'i 1 

^3 

1 


1 




V 

2 

-2/ 

\ 2 2/ 

\ 2 

2/ 

\ 2 

2/ 


In r 3 , the matrices e and a can be written down immediately, d and 
f are obtained from 104 by inserting the proper value of 6; b and c 
can then be found by means of the group multiplication table and the 
rule for matrix multiplication. 

It is possible to find other representations of the group. For ex- 
ample, if we assign to the points, a, b, and c the coordinates (x a , y a ), 
etc., the matrices of the transformations would be of dimension 6 
(six-row matrices) and would form a representation of the group. 
Let us suppose that we have found some such representation, and let 
us call the corresponding matrices e\ a', b r , c', d', i'. The new set 
of matrices & ,r t — p 1 e / p; &" — p _1 a , p / ; W = p — 1 b / p, etc., also form 
a representation of the group, as may be seen as follows. Assume that 

a"b" = d" 1016 

Then p — 1 a , pp~ 1 b , p = p -1 d'p. From the associative law of multi- 
plication, we have 


p-Vb'p = p-M'p 


If we now multiply from the left by P and from the right by p 1 we have 

aV = d' 1017 

Since 10-17 is true, 10.16 must also be true; the transformed matrices 
e", etc., therefore also form a representation of the group. The trans- 
formations of the type a" = p -1 a'p are called similarity transfor- 
mations. Let us now suppose that it is possible to find a similarity 
transformation which will transform all the matrices e', sl • • • into 
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V: 

the form 


a" = p~Vp = 


&1 

0 

0 

0 

a 2 

0 

0 

0 

a 3 


\ 


10-18 


where &[' is a square matrix which has the same dimension as 
b[', c[' • • -, and where there are only zeros outside the squares. 
Since a /, b ,/ = d" ■ • • , we have from the law of matrix multiplication 
the relations 


a('b(' = d” 

a 2 'b 2 ' = d' 2 ' 10-19 


The sets of matrices e{', b[ f • ■ e 2 ', a 2 ', b 2 ' • • • ; etc., therefore 

form representations of the group. The matrix representation e', 
a', b' ■ • • is said to be reducible and to have been reduced by the simi- 
larity transformation with the matrix p. If it is not possible to find a 
similarity transformation which will further reduce all the matrices 
of a given representation, the representation is said to be irreducible. 

The representations Ti, r 2 , r 3 given above are all irreducible. Since 
matrices representing transformations of interest to us are unitary, 
we may restrict ourselves to representations which involve only unitary 
matrices and to similarity transformations with unitary matrices. 
Two irreducible representations which differ only by a similarity trans- 
formation are said to be equivalent. We shall now show that the 
non-equivalent irreducible representations r t , r 2 , r 3 given above are 
the only non-equivalent irreducible representations of the correspond- 
ing group, and we shall then state, without proof, certain general 
theorems regarding irreducible representations. 

We denote by r,(f2) the matrix corresponding to the operation R 
of the ith irreducible representation, and by r t (R) mn the mnth com- 
ponent of this matrix. For the above representations, we therefore 
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have the relations : l 


ZTiiRhiTxiBht - 6 

ft 

Er 2 («)iir 2 (B)n - e 

R 

Er3(jR)nr 8 (jB) n = Er 3 (2J)i2r 3 (ft)i 2 

R R 

= Er 3 (i2)2ir 3 (iB)2i = Er 3 (/2)22r 3 (/J)22 = 3 

ft 


10*20 


10*21 


In addition, we note that 

zri(fl)nr 2 (fi) n =0 

ft 

Eri(B)ur 3 (B) u =o 

ft 

EncfljnrsfBJu = o 

ft 

Er 8 (B)iir 8 (B)i 2 = 0 , etc. 

ft 

All the relations of the type 10*21 can be expressed by the equations 

= 0; tV j 

R 10*22 
Y,ri(R)mnTi(R)m'n' =0; m 5 ^ w! \ n ^ n f 

ft 


while the relations 10*20 can be written as 
'LT i (R) mn r i (R) mn 

ft 


h 

k 


10-23 


where h is the order of the group and Z,- is the dimension of tiiQ ith 
representation. Equations 10-22 and 10-23 can be combined into the 
general relation (see Appendix VI) 

Lr,(fl) m „ yfj Tj(R) m 'n> yf-i = «<,• W S nn , 10-24 

where 6 f *y =» 1 if i — j; 5,y = 0 otherwise. Equation 10-24 may be shown 
to be true for the non-equivalent irreducible representations of any group . 

From equation 10*22 we see that the matrix components I\*(jRi) w », 
Ti(R 2 )mn * * * I \(Rh)mn of the h elements of the group can be regarded 
as the components of an /t-dimensional vector which is orthogonal to 
any one of the vectors obtained by a different choice of the subscripts 

1 To be more general, these expressions should be replaced by EtFi(i?)]uri(R)ii f 

ft 

etc., but this form of the equations will be sufficiently general for our purposes. 
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m and w, as well as being orthogonal to any of the similar vectors ob- 
tained from a different irreducible representation. If there are c such 
irreducible representations, each of dimension U, there are l\ + + • • * 

+ l 2 such orthogonal vectors. But it is possible to construct only 
h orthogonal ^-dimensional vectors. Actually, 

^1 *4" d” • * • *4~ if == h 10*25 

This result is perfectly general and follows directly from 10*24. The 
representations Tj, r 2 , and r 3 are therefore the only non-equivalent 
irreducible representations of the symmetric group of three points. 

The sum of the diagonal elements of a matrix is known as the char- 
acter of the matrix. We denote by Xi(R) the character of the matrix 
of the operation R belonging to the ith irreducible representation of 
the group, that is 

Xi(R) = zr i{R)mm 10*26 

m 

The characters of the representations Fi, r 2 , r 3 of the group which we 
have been discussing are 

E A B C D F 

xi 1 1 1 1 1 1 

X2 1 -1 -1 -1 1 1 

x 3 2 0 0 0 -1 -1 

The character of a matrix is unchanged by a similarity transformation. 

The character of a matrix P is xr — Y.Fjj- The character of 

i 

Q = X~ x PX is 

XQ = ZQii = EEI :x?p jk x ti = ZEZX ki X^P jk 

i i j k j k x 

= EL hjPjk = HPjj = xp 

i k j 

If two operations belong to the same class, the corresponding matrices 
for a given representation have the same character, as may also be 
verified from the character table above. 

Prom 10*24, we have the result 

i{R) mm T j (R)m , m' ** Y ^*7 

R 

Summing over m from 1 to U and over m! from 1 to lj gives 

Mw(«) = r 5 w E E W - E 1 - hSii 10-27 

, M lj *»"1 m'» 1 lj m'«l ,nJ 
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We see, therefore, that the characters of the matrices of the irreducible 
representations form sets of orthogonal vectors. Since the character 
is unchanged by a similarity transformation, we see that two non- 
equivalent representations have different character systems and that 
two irreducible representations with the same character system are 
equivalent. 

Since the characters of all matrices of a given representation which 
correspond to operations in the same class are equal, 10*27 can be 
written as 

Z Xi(Rp)xj(Rp)g B = hSij 

P-1 

or 

Z xi(R,) yjj xM = in 10-28 


where g p is the number of elements in class p, R p is any one of the 
operations in this class, x(R P ) is the corresponding character, and k is 


the number of classes. The normalized characters Xi(R P ) 


& 


are 


therefore the components of a set of orthogonal vectors in ^-dimensional 
space. Since there can be k such vectors, we see that the number of 
irreducible representations is equal to the number of classes. 

From the relations already developed it is possible to obtain further 
interesting results. Any matrix representation of a group must be 
some one of the irreducible representations or some combination of 
them; otherwise it would be an additional irreducible representation, 
but the number of irreducible representations is limited to thg number 
of classes. Any reducible representation can be reduced to* its irre- 
ducible representations by a similarity transformation whh^baves the 
character unchanged. Thus we can write for the character matrix 
R of the reducible representation the expression 



x(R) 


£ ajXj(R) 

3 - 1 


10*29 


where ay is the number of times the jth irreducible representation 
occurs in the reducible representation. From 10*27 we have 


Ex(R)xi(R) = EDw(*)x<(*) - ** 

« ft i 

so that the numbqr of times the irreducible representation 


10-30 
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in the reducible representation is 

a i =JZx(% i (B) 10-31 

h r 

Since there is a one-to-one correspondence between the character sys- 
tems of a group and the irreducible representations of the group, we 
will usually find it sufficient to deal with the characters themselves 
rather than with the irreducible representations. For any group, the 
character table can be built up by means of the relations already de- 
rived. We here summarize these rules in a convenient form. 

Rule 1. The number of irreducible representations is equal to 
the number of classes of the group. 

Rule 2. The sum of the squares of the dimensions of the irre- 
ducible representations of a group is equal to the order of the group, 
that is, 

li + ^2 + * * * + tfc = h 

Since lj = this is equivalent to the relation 

Elxj(E )] 2 = h 10-32 

J 

Rule 3. The character systems of non-equivalent irreducible 
representations form orthogonal vectors; that is 

Hxi(R)Xj(R) = 0 ; i 9 * j 10-33 

R 

Rule 4. The sum of the squares of the characters of a given ir- 
reducible representation is equal to the order of the group; that is 

* Uxi(R )] 2 - h 10-34 

ijV R 

10c. Group Theory and Quantum Mechanics. We consider now 
the Schrfidinger equation 

' ,ir H 

for some atomic or molecular system. Suppose that R is some trans- 
formation of coordinates which has the effect of interchanging like 
particles in the system. For example, in helium, R could be the trans- 
formation which interchanges the two electrons; in H 2 0, R could be 
the transformation which interchanges the hydrogen atoms. We sub- 
ject both sides of the Schrodinger equation to the transformation R f 
obtaining ERfa = RErfi. Since R interchanges only like particles, 
it can have no effect od the Hamiltonian, so that KEL « HR. R, of 
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course, commutes with the constant Ei, so that we have 

JIRti = EiRfa 10-35 

that is, the function Rfc is a solution of the Schrodinger equation with 
the eigenvalue If Ei is a non-degenerate eigenvalue, then fa or 
constant multiples of are the only eigenfunctions satisfying 10*35, 
so that for this case we have Rrpi = in order that Rfc be normalized, 
c = ±1. If Ei is ifc-fold degenerate, then any linear combination of 
the functions V'ti, fe, * * * 'I'ik will be a solution of 10*35, so that in this 
case we have 

k 

Rtn = 10*36 

y-i 

where the arf s must satisfy the relation 

e4 = i 

;-l 

If >S is another operation which interchanges like particles, we also have 

k 

Stii = E b mf hm 10*37 

m»l 

Applying operation S to 10*36 gives 

& k k 

SRfii = X) OjiSfaj = I I ajibmjtim 10-38 

j-1 ; — lm*l 

Now the product of 5 and 1?, which we may denote by SR = T, is 
likewise an operation which interchanges like particles, so that 

k 

, % - E 10*39 

TO** 1 

Comparing 10*38 and 10*39, we see that 

k 

Cml ** E b m j(Zjl 10*40 

i-l 

If we now form the matrix a from the coefficients aji, and the matrix 
b from the coefficients b m j , we see that the product of these two ma- 
trices is equal to the matrix c formed from the coefficient more- 
over, all the matrices are unitary. In other words, the matrices 
obtained from the coefficients in the expansion of R^u, etc., form a 
representation of the group of operations which leave the Hamiltonian 
unchanged. The set of eigenfunctions \pu, • • is said to form a 
basis for a representation of the group, since the representation is 
generated by the application of operations R, S y etc. The dimension 
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of the representation is equal to the degeneracy of the corresponding 
eigenvalue. The representations generated by the eigenfunctions 
corresponding to a single eigenvalue are irreducible representations, as 
otherwise it would be possible to form sets of linear combinations 

Mu tv • * s t'i, 8 +i • * •; • • •; * • • tik 

of the original eigenfunctions such that operations of the group would 
send one of the new eigenfunctions into a linear combination involving 
only members of the same set. But, if this were possible, the eigen- 
values corresponding to the new sets could be different, which would 
contradict our original assumption, except for the extremely rare case 
of “ accidental degeneracy ” (where two eigenvalues are the same 
even though the corresponding eigenfunctions behave differently under 
the operations of the group). We may therefore in general assume 
that sets of eigenfunctions with the same eigenvalue form a basis for 
an irreducible representation of the group of operations which leave 
the Hamiltonian unchanged. Returning to our original notation, if 
Tj is an irreducible representation of dimension k f and if $f, 
is a set of degenerate eigenfunctions which form the basis for the jth 
irreducible representation of the group of symmetry operations, these 
eigenfunctions transform according to the relation 

m = zr io-4i 

/-i 

If we are dealing with a symmetrical atomic or molecular system, these 
considerations place a severe restriction on the possible eigenfunctions 
of the system. All possible eigenfunctions must form bases for some 
irreducible representation of the group of symmetry operations. From 
a knowledge of the irreducible representations of the group, we there- 
fore know immediately what degrees of degeneracy are possible. The 
form of the possible eigenfunctions is also determined to a large extent, 
since they must transform in a quite definite way under the operations 
of the group. For example, if our system had the symmetry of the 
group of three points which we have discussed in detail in this chapter, 
our eigenfunctions would be of the following types. There would be 
a set of eigenfunctions which would form bases for the representation 
3?i. These eigenfunctions would be non-degenerate and would remain 
unchanged if subjected to any of the operations of the group. 
There would be another set of non-degenerate eigenfunctions which 
form bases for the representation r 2 ; these would remain unchanged 
If subjected to operations E, D, and F } but would change sign if sub- 
jected to operations A, B, or C. Finally, there would be a set of doubly 
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degenerate eigenvalues; two eigenfunctions with the same eigenvalue 
would behave in the manner determined by the matrices for the irre- 
ducible representation r 3 and equation 10-41. No other types of 
eigenfunctions would be possible; for example, there would be no 
triply degenerate eigenvalues, nor would there be any non-degenerate 
eigenfunctions which changed sign when subjected to operations D 
or F. 

lOd. The Direct Product. Let us suppose that R is some operation 
of a group, and that Ai, A 2 • • • A m ; B h B 2 - • B n are two sets of 
functions which form bases for representations of the group. Then 

m 

RAi = YL a jiAj 
y-i 

RBk = E hkBi 

2-1 

and 

RA{B k = E E djibikAjBi = EEtyj, ik AjBi 

i j-i i i 

The set of functions A{B k forms a basis for a representation of the group 
of dimension mn. The matrix c of this representation has the character 

x(c) = EEty*, ji = E E«ii&« = x(a)x(&) 10-42 

i i y-i 2—i 

The set of functions A ,-5* is called the direct product of the sets of func- 
tions Ai and Equation 10-42 then tells us that the character of 
the representation of the direct product is equal to the product of the 
characters of the individual representations. The representation of 
the direct product of two irreducible representations will in general 
be a reducible representation but may be expressed in terms of the 
irreducible representations by means of equation 10-31. For example, 
for the direct products of the irreducible representations r 1; r 2 , r 3 of 
the symmetric group of three points, we have ' 

TiTi = r x r x r 2 = r 2 

r 2 r 2 = r x r x r 3 = r 3 

r 3 r 3 = r x + r 2 + r 3 r 2 r 3 = r 3 

The importance of the direct product appears when we wish to evalu- 
ate integrals involving functions which are bases for representations of 

the group. If we have an integral J' <pavb dr, - this integral will be 

different from zero only if the integrand is invariant under all the 
operations of the group or may be expressed as a sum of terms of which 
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Bit least one is invariant. The integrand belongs to the representation 
Tin* = Ta where Fa I'b is the direct product of the representations 
of <pa and <pb- In general, TaT^ will be reducible, that is, will be ex- 
pressible as 

T a T b = L«.Tv 

1 

where the lYs are irreducible representations of the group. The in- 
tegral will be different from zero only if TaTb contains the totally 
symmetrical representation IY It may readily be verified from the 
tables in Appendix VII that, if the characters of the representation 
are real, as they will be in all cases of interest to us, then IYIY con- 
tains Ti only if Ya = IV For our purposes, therefore, we may state 

the following corollary to this theorem : The integral J <paJ<pb dr is 

different from zero only if T a Tb = I>. Moreover, since the Hamil- 
tonian operator belongs to the totally symmetrical representation r lf 

the integral J <paH<pb dr is different from zero only if T A = IV In 

the secular determinant of the type 

Hu — SnE H 12 — S12E • • • Hin — SinE 

H21 — S21E H22 ~ S22E * • • H2n ~ $2 nE 

’ = 0 

Hnl SnlE Hn2 S n 2E * * * H nn S n nE 
the terms and Sy will be different from zero only if and <pj be- 
long to the same irreducible representation. By classifying the eigent 
functions <p according to the representation to which they belong, i- 
is often possible to reduce the order of the secular equation. 

It may happen that in certain problems we start a perturbation 
calculation with zero-order eigenfunctions which do not themselves 
form bases for irreducible representations of a group. If we take the 
proper linear combinations of these eigenfunctions so that the new 
eigenfunctions form bases for irreducible representations, the secular 
equation will be simplified. These linear combinations can be found 
by the following procedure. Denote the original eigenfunctions by 
<p and the new by <p, where <p % km is the eigenfunction belonging to the 
tth irreducible representation of dimension U with the eigenvalue 
Suppose further that there are $»• eigenvalues corresponding to the 
representation F,*; that is, the representation occurs times in the 
reducible representation to which the <p h s belong. Then any of the 
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<p h a may be expressed in terms of the <f>’s by . 

V = E L E c i ikmfkm 1^-4^ 

If 22 is any operation of the group, then by 1041 

V = EE E c tAm E r.CflwL id-44 

t m=l n=l 

lj - . 


If we now multiply by xj(R) = Z and sum over all operatJbns 

1 

of the group, we have / 

XxmRv = E E E c ikm E E Er i («)„r l (f?) nm ^l n io4s 

R i Jt-1 T7>~ 1 n-1 t« 1 K 

From 10*24, we see that this expression reduces to 

Ex/ (B)Rp' = E Zc jkm y E E fitn&im *Pkn 1046 

B t=l m-1 l 3 n-1 /"I 

•i l l h 

= EEr c »A 1047 

Equation 10*47 has the following meaning. If both Sj and lj are unity, 

then will give a constant times <p{ X) regardless of which 

R 

<p we use. If lj is unity, but Sj is, say, two, then Y,Xj(R)R<p' will give 

R 

expressions of the form cuf{\ + 6^21- There will be two linearly inde- 
pendent expressions of this form; the combinations of those which 
correspond to the two eigenvalues are determined in the usual way 
after the corresponding two-row determinant has been solved. For 
Sj equal to unity, lj equal to 2, we obtain two independent linear com- 
binations which have the same eigenvalue. For s/ and lj equal to 2, 
we obtain four independent linear combinations, the solution of the 
corresponding secular determinant will then enable us to form two 
sets of two combinations each, one set for each of the two eigenvalues. 
The other cases are analogous. 

A description of the various symmetry groups of interest in the theory 
of molecular structure has been included in Appendix VII. Appen- 
dix VII also contains the character tables for these groups, as well as 
the transformation properties of certain quantities which will be of 
interest in our later work. We shall make considerable use of group 
theory in later chapters; the actual applications are much easier to 
perform than might be expected from some of the rather complicated 
equations which appear above. 



CHAPTER XI 

ELECTRONIC STATES OF DIATOMIC MOLECULES 

11a. Separation of Electronic and Nuclear Motions. A molecule 
is usually defined as a stable group of atoms held together by valence 
forces. We shall here, however, use the word molecule in a somewhat 
wider sense, to denote any system of atomic nuclei and electrons, 
Whether stable or not. If we regard the problem of the motion of such 
a system from the classical viewpoint, we see that, because of the great 
masses of the nuclei as compared with the mass of the electron, the 
electrons will move with much greater velocities than the nuclei, so 
that, to a first approximation at least, the motion of the electrons is the 
same as it would be if the nuclei were held fixed in space. 

This same approximation is stated quantum mechanically by the 
assumption that the eigenfunction for the whole system may be ex- 
pressed as the product of the two factors and where involves 
only the coordinates of the nuclei, while is an eigenfunction of the 
electronic coordinates found by solving Schrodinger’s equation with the 
assumption that the nuclei are held fixed in space. The coordinates 
of the nuclei would thus enter only as parameters. 

In order to test the validity of this assumption we need to see if such 
an eigenfunction can satisfy, to a good approximation, the wave equa- 
tion for the whole system. The exact Hamiltonian operator may be 
written as 

H = 8 ^M a Vl ~ ? 8^ V? + Fnn + Vne + Vet 1M 

where the first term represents the kinetic energy of the nuclei, the 
second represents the kinetic energy of the electrons, and F n „, V n „ 
and F„ are the contributions to the potential energy arising from 
nuclear, nuclear-electronic, and electronic interactions, respectively. 
If the nuclei were assumed to be fixed in space, the Hamiltonian for 
the electrons would be 

H. = -L V? + F*. + V.„ 11-2 

, m 


1AA 
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If we now represent the remaining terms in 11-1 by H„ we have 
H„ - 

a 8 irM a 
and 


H = H n + H„ 

We define as the function which satisfies the equation 

H e ^e = E e \[/ e 


11*3 
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11*5 


where E, is the electronic energy. Now if we write the wave equation 
for the complete system, assuming to be of the form iM'*, where 
\p„ is a function of nuclear coordinates only, we have 


or 


-V 

a 

Now 


Vltetn ~ E 


H = EMn 
h 2 


11*6 


Vi'l'e'Pn 


7 8ir 2 m 

+ ( V nn + F„, + V.Mn - EMn 11*7 

VlMn = + 2V„*, * V a * n + * n VU t 

ViMn = WU, 


so that 11*7 becomes 

- ? v *** - v * ■ ? dir. *' ^*) + *•? ■ zkr. v ^* 

+ *»E “ A - + (Fnn + F». + F„)M» = EMn 11*8 

i * of TYi 


If we neglect the terms in braces, this reduces to 

t ? - B§T. ’** + [? - BC ^ + < v ” + VM ’] 

~ E*. + V nn <p, = 0 11-9 

or, from 11*2 and 11*5, 

E ~ + F nn'/'n + E,* n - E+ n - 0 11*10 


which, because of 11*3, is equivalent to 

(H„ + £.)*„ = Ei(f n 11*11 

We thus see that, if our approximation is valid, the effective Hamil- 
tonian for nuclear motion is just that which would arise if we assumed 
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that the electronic energy E ei which will be a function of the inter- 
nuclear distances regarded as parameters, behaved as a part of the 
potential energy of the nuclei. For equation 11-11 to be valid, the 
terms in braces in 11*8 must be small in comparison to the term 
h 2 

— 2T T which represents the kinetic energy of the nuclei. 

d 8 7T ]\l(x 

}f/ e is usually 'only a slowly varying function of the nuclear coordinates, 
so that is much smaller than V a ^ n ; hence the approximation will 
be valid. Otherwise the neglected terms may be treated as a perturba- 
tion and will give rise to energy terms representing the interactions 
of electronic and nuclear motions. We shall postpone further con- 
sideration of equation 11*11 to Chapter XIV, devoting the remainder 
of this and the following two chapters to the solution of equation 11*5. 

lib. Molecular Orbitals; The H a + Ion. The problem of the 
electronic structure of molecules bears many resemblances to the prob- 
lem of atomic structure. Just as the eigenfunctions of atoms are 
usually built up as linear combinations of atomic orbitals, so may the 
eigenfunctions of molecules be approximated by a series of molecular 
orbitals. No method for the construction of molecular orbitals, how- 
ever, is comparable in accuracy to the Hartree method for atoms. It 
is true that the principles of the Hartree method apply equally to atoms 
and molecules, but the difficulties encountered in the numerical inte- 
grations needed to calculate the Hartree field of even the simplest 
molecules have not yet been overcome. 

The chief source of trouble in molecular problems is the absence of 
the spherical symmetry of the isolated atom. The operators M 2 
and its components, which played such an important part in our treat- 
ment of atomic structure, no longer commute with the Hamiltonian, 
and so they lose their usefulness. It is true that many molecules have 
some elements of symmetry, when the problem can be simplified with 
the aid of group theory, but these symmetry elements are properties 
of individual molecules and cannot be used in the general theory of 
molecular structure. 

In order to set up a system of molecular orbitals we are almost 
forced to use linear combinations of some set of functions in terms of 
which an arbitrary function may be expressed. Such a set might be 
the atomic orbitals of any one of the atoms composing the molecule. 
This set, however, would converge very slowly if we tried to expand 
in terms of it an orbital belonging to some other atom of the molecule. 
If we use as our set the orbitals of all the atoms of the molecule we 
should expect rather rapid convergence of our molecular orbitals. The 
value of this method can be estimated only by actual trial. 
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Let us therefore suppose that we have set up an approximate po- 
tential field V for the molecule, in which an electron is to move. This 
field might, for example, be that obtained by the superposition of the 
Hartree fields of the component atoms. Let us also suppose that we 
are given a set of functions <pi, <p2, - - - <Pn, which we shall think of as 
atomic orbitals of the various atoms of the molecule, although any set 
of independent functions could be used. The approximate orbitals 
can then be found by the method of trial eigenfunctions, using v?i, • • • <p n 
as the zero-order functions. The approximate energies of the molecu- 
lar orbitals will therefore be the roots of the secular equation 


H u - S n E ... H ln - S ln E 


= 0 11*12 


H n i — S n \E 


H n 


- S nn E 


where, if H is the one-electron Hamiltonian H = — 0 - V 2 + V. 

8 t r 2 m 


Hij = J'<p*K<p j dr 

11-13 

Sij — f <Pi<pj dr 

11-14 


Unless the atoms of the molecule are infinitely far apart, the integral 
Sij will not in general vanish, since <pi and <pj are not necessarily eigen- 
functions of the same Hamiltonian and are therefore not necessarily 
orthogonal. 

If, however, the atoms are a large distance apart, all the off-diagonal 
terms of 11-12 will vanish; those between orbitals of different atoms be- 
cause each orbital vanishes over the region where the other has a finite 
value, those on the same atom because then H is just the atomic Hamil- 
tonian in the region where the orbitals do not vanish, and the orbitals 
are eigenfunctions of the atomic Hamiltonian. If the atomic orbitals 
are normalized, equation 11- 12 is of the form 


H n -E 0 

0 H22 — E 


0 

0 


= 0 11-15 


0 


0 


••• Hnn-E 
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and the roots are E = Hu, H22, • • • H nn . In other words, the energies 
of the molecular orbitals are equal to the energies of the atomic orbitals 
if the atoms are far apart. 

If we now bring the atoms together, the roots of equation 11-12 will 
change continuously. We may thus correlate each of the n roots of 
11-12 for any configuration of the nuclei with one of the energies of 
one of the separated atoms. This does not, however, imply that each 
molecular orbital will become an atomic orbital on separation of the 
nuclei, for, if any two of the atomic orbitals have the same energy, 
the molecular orbitals whose energies approach this energy on separa- 
tion will usually go over into some linear combination of these orbitals. 
To illustrate this effect let us consider two orbitals which have the same 
energy at infinite separation. When the separation is large, equation 
11-12 has the form 


E 1 — E e 
e Ei — E 


= 0 


11-16 


where E 1 is the energy of each orbital and « is the small value of H 12 . 
The roots of this equation are E = E\ ± «. If we take E = E\ + 1 , 
the linear combination of <p\ and <P2 given by equation 7-49 is 
1 

— y= (<p 1 + <p 2 ) ; while if we take E = E x — t, the linear combination is 
V2 

—7= ( <pi — <Pi)- If we go to the other extreme and let the distances 
V2 

between the nuclei become zero, the Hamiltonian of the molecule re- 
duces to that of an atom whose nuclear charge is the sum of the nuclear 
charges of the atoms composing the molecule. The proper orbitals 
are then just the atomic orbitals of this “ united atom.” Each mo- 
lecular orbital may therefore be designated by the united atom orbital 
into which it degenerates when the nuclei are brought together, as well 
as by the atomic orbital (or combination of such) which it becomes 
when the nuclei are separated. 

As an example of this procedure let us consider the hydrogen mo- 
lecular ion H 2 + . In this molecule just one electron is moving in the 
potential field of the two nuclei. We may get a rough description of 
the lowest orbital of this molecule by considering it as a linear combina- 
tion of the Is orbitals of the two hydrogen atoms. Let us designate 
the nuclei by the letters a and b, the Is orbital of an electron in the field 
of nucleus a alone by ip a , end the Is orbital of an electron in the field 
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ids 


of nucleus b alone by ipt- Analytically (in atomic units) 



V i r 


11-17 

ib = - 7 = 
v x 

If we put 

S — J 'I'a'l'b dr Hbb = J 'PbH-4'b dr 

11-18 

/ ^aH^a dr i/ a6 = J* * 0 H* 5 dr = tf 6o 

the determination of the coefficients in the linear combination 


i + Cbib 

leads to the following secular determinant for the energy 


Haa- E 
H ab - SE 


Hab - SE 


Hbb - E 


= 0 


11-19 


11-20 


From the symmetry of the problem it is evident that H aa = H bb - Us- 
ing this relation, the roots of the determinant are found to be 


or 


Haa — E = =F ( Hab ~ SE) 


Ei = 


Haa ~b Hab 
1 + S 1 


jE?2 — 


Haa Hgb 
1 - S 


11-21 


The first root gives the following set of simultaneous equations for the 
coefficients in 11-19 


±[(2/ aa £ — Hab) ('a "l" ( Hab — SH a a) c b] = 0 11-22 

Equations 11-22 are satisfied only if c 0 — c b . In order that the eigen- 
function 11-19 be normalized, we must have 

<% + 4 + 2c a CbS = 1 


Cb - 


V2 + 2 S 


so that 
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Similarly, the second root gives 


Co = ~Cb = 


'2 - 2 S 


The wave functions and their associated energies are therefore 


fa + 'kb 

V2 + 2S 
fa ~ fb 


' 2 - 2 S 


Haa | Hub 
1 + 8 

Haa Hab 

1 - S 


The integrals S, Haa, and H a b may all be evaluated exactly. In atomic 
units the Hamiltonian is 


( zV2 + R a + R b r) 


where R is the internuclear distance in units of a 0 . Since 




where E H is the energy of the ground state of hydrogen, the matrix 
elements for the energy become 

„ _ B • 1 . . ... Cfl , 


H aa = Eft + “ *aa 

Hab = & ' 

so that the energy levels are 

Ei = Eft + 


tab } *ab 


rr. , 

-Jr/ 

^ r Mb 

J R a 


E 2 — Eft + 


1 taa ”"f" c afe 

R 1 -j- S 

1 *aa € ab 

R ~ 1 - £ 


In order to evaluate the integrals involved in 11*25 and 11*26, it is 
convenient to transform to elliptical coordinates (Appendix III) 


Ra + Rb 


Ra — Rb 


jd3 

dr a — (^ 2 — j, 2 ) dfi dv dtp 
o 

1 ^ M ^ 00 ; -1 < ^ < 1; 0<^><2ir 
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For the “ overlap ” integral S we have 

S = fMbdr =^J <>-<««+«» dr 

d 3 /»°° /»1 /»2ir 

= — J d/i J (m 2 — v 2 ) <2? J dp 

-IX <^*-1 X •"** 


11-27 


The integrals involved are special cases of the general integral (Appen- 
dix VIII). 




>~° x dx = 


n!e~° JL a* 


n+l S7.1 — -dn(o) 


r 0 fc! 


so that 5 is readily found to be 

S = e~ R ^ 


^(i + R + f) 


11-28 


11-29 


The integral t aa is 

- 2 Ra J 2 f*e~ R 


^ao 


_ 1 £ e~ ZK * _ 1 2 p 

“ t rJ R b T ~ttrJ 


dr 


30 


= ?{X / e ~ R " dv +f 1 e ~ R * dn f ^e~ Rp dv J 11 

The integrals in v are special cases of the integral 

J x n e~ ax dx = (— l) n+1 A n (— a) - A„(o) 11-31 

Inserting the proper values for the integrals gives 

«aa = |ll-e- 2fi (l+B)} H-32 


In the same way, we find for «„& : 

« o6 = e~ R (l -(- R ) 11-33 

For large values of 12 we see that 5 = 0, H aa = Bh, Hob — 0, so that 
Ei = E a = E s , that is, just the energy of a normal hydrogen atom, 

as of course it should be. For R = 0, 5 = 1, Haa = B H — 1 + — » 

Hab — Haa. Neglecting the nuclear repulsion term \ for the time 
being, we see that the electronic energies are B{ = 3B H , Bg = 0. 
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When R = 0 , the lowest molecular orbital should become the Is atomic 
orbital of He, with an energy 4-E H * Our approximation is therefore 
in error by an amount for R = 0, although it is correct for large R . 
The reason for this is clear. For large R the orbital fa is the correct 
orbital for a hydrogen atom, but for R = 0 it is again a hydrogen or- 
bital, but surrounding a nucleus whose charge is two instead of one. 
In order to get a good approximation for small R we should take a 
number of orbitals for each atom. 

Referring to 11*26, and for simplicity neglecting S as compared with 
unity, we see that the difference in energy between the two states, to 
this approximation, is just 2e a b . Also, to this approximation, state 
fa is unstable with respect to a hydrogen atom and a proton by an 
amount (1 + R){e~ R + e~ 2R ) while state fa is stable by an amount 
(1 + R){e~ R — e ~ 2R ). That this should be so may be seen qualitar 
tively in the following manner. For state \p i, the electron density is 

Pi = 'Pi'Pi = 2 _|_ 2S ^ ^ + Wo^b) 

While for state fa it is 

P 2 = g "i 2s ~ 2 tab) 


At a point midway between the two nuclei, we have 


Pi 


2 + 2 S 


fa) P2 


0 


State \pi thus has a much greater accumulation of charge between 
the two nuclei than state ^ 2 ) the attraction between this accumulation 
of charge and the two protons may be considered as producing the 
stability of state fa. 

It is of some interest to look at this problem from the following view- 
point. The wave functions, including the time-dependent term, are, 
if we neglect S as compared with unity, 

— X — t 1 .jft , 

'J'i = 'Pie * = —7= (V'a + ib)e * * 

.Et. -i 

*2 = ^~ l T = T/2 ~ ^ e ~ * * 

Any linear combination of these two solutions will represent some 
particular distribution of electron density. Let us consider tihe com- 
bination ¥ = - 7 = (’Pi + ^ 2 ). The electron density corresponding to 
V2 
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]+*$) 


this state is 

/ r , (g.-g.) , (g.-jg.) -| \ 

P = i \^i + 'Pi'1'2 le x + e * J + ^ 2 / 

Fort = 0, p = 5(^1 + fo) 2 = l , so that the electron is on nucleus o. 

_ 7T ft 

° r * = - fi 2 ’ 

p = 2W1 - 1A2) 2 = 

J 08 n 




Experimental 


Fig. 11*1. Binding energy of H 2 + as a function of the internuclear distance. 

so that the electron is on nucleus b. From this viewpoint (which should 
not be taken too literally), the electron oscillates between a and 6, 

the frequency of the oscillation being v . We thus have 
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the result A E = hv, where v is the frequency of the oscillation between 
the two states and A E is the difference in energy between these two 
states. 

In Figure 11-1 the energy levels for these two states are plotted as 
a function of the internuclear distance R, along with the experimental 
curve as determined from spectroscopic data. In Figure 11-2 the dis- 



tribution of charge along the internuclear axis is shown. It is seen 
that this approximation gives qualitatively correct results, although 
quantitatively the treatment is not very satisfactory. The results 
can be somewhat improved by taking more complicated zero-order 
functions. 

A simple method of improving the agreement would be to introduce 
a parameter into and for example, we might take 



and then vary a so that the energy is minimized. We should then get 
agreement at R = 0 as well as at R = oo , The energy may be further 
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improved by including in the secular equation the 2 p orbitals of the 
hydrogen atoms. The inclusion of these terms partly takes into ac- 
count the polarization of the hydrogen atom by the other nucleus. 

We shall see in Chapter XIV how the depth D e , the intemuclear 
distance r e , and the curvature of the energy curve at the minimum 
may be determined from spectroscopic data. For H 2 + the band 
spectra indicate the values D e = 2.791 e.v. and r e = 1.06 A. The 
simple theory as described above gives D e = 1.76 e.v. and r e = 1.32 A. 
Introduction of the parameter a improves these results to D e = 2.25 e.v. 
and r e = 1.06 A , 1 and inclusion of the 2 p orbitals gives D e = 2,71 e.v., 
r e = 1.06 A. 2 The value of D e could, of course, be improved by add- 
ing more and more hydrogen orbitals. 

The best orbitals which have been obtained were found by a different 
method of approach. This method is similar to that used in the varia- 
tional treatment of the helium atom; hydrogenlike orbitals are given 
up completely. For the H 2 + ion, the natural coordinates to use are 
the elliptical coordinates r, ip. James 3 found that a good approxi- 
mation to the lowest orbital of this molecule is 

+ = + cv 2 ) 

where 5 and c are parameters. For the observed internuclear dis- 
tance 1.06 A, the best values of the parameters give D e = 2.772 e.v., 
which is quite close to the experimental value. Even better results 
may be obtained, however, since the wave equation is separable in 
these coordinates and may be solved by numerical integration. We 
shall discuss the results in detail in the following section. 

11c. The Electronic States of the H a + Ion. In the theory of the 
electronic states of molecules, particularly of diatomic molecules, the 
simplest example, H 2 + , plays a role of importance equal to that of the 
hydrogen atom in the problem of the electronic structure of complex 
sttoms. We wish at this point, therefore, to present a general dis- 
cussion of the possible states of this molecule and its exact energy 
levels before proceeding to the description of more complicated dia- 
tomic molecules. 

The wave equation for the hydrogen molecule ion may be written as 

« 2 » . 2m/ e 2 e 2 e 2 \ 

V ^ + “tj ( -E H 1 W = 0 11-34 

n \ r a Tb r a b/ 

1 B. Finkelstein and G. Horowitz, Z. Physik , 48 , 118 (1928). 

2 B. Dickinson, J. Chem . Phys., 1 , 317 (1933). 

2 H. M. James, J. Chem. Phy8. f 3, 7 (1935). 
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We now transform to the elliptical coordinates 


r a + n Tg - r b 

r a b ’ V r ai ' V 


In this coordinate system, the Laplacian operator V 2 is (Appendix III) 


rW ~ v 2 ) X 

far.- . el . d r.: dl . v? - v 2 


Equation 11-34, in the new coordinate system, is then 


1)(1 - v 2 ) d<p‘ 


(M 2 - 1) 


0 + s[ 


+ 73 


2 2 
fl — V 


dvj (n 2 — 1)(1 — v 2 ) d(p 2 
+ { - (m 2 - v 2 )t + 2 nRW = 0 11-35 


where 


(-£> 


D rne* r ab 

R *2 — 

n a 0 


We now try to find a solution of the form 
yp = M (n)N (v)$(<p) 


Since ip enters equation 11-35 only in the term — « , it is at once appar- 

d<p 

ent that this equation can be separated into a part dependent on ip 
alone and a part dependent on n and v. We call the first separational 
parameter —X 2 , so that &{tp) satisfies the equation 

—r = — \ 2 y? 11-37 

dp 2 


Equation 11-35 is thus reduced to 


M dfJL 


, 2 dM X 2 o , 

M 2 - 1 ) — 2 7 “ m 2 « + 2R ^ 

dfi M - 1 


1 d „ dN ,, X 2 2 „ OD 

= _ — — _ v 2 ) — •+ j -ye 11-38 

N dv dv 1 — v 


We set both sides of this equation equal to — r and thus obtain the 
final differential equations for M(n) and N(v). 


0» a — l) 


i + \ m 2 -1 


IM 2 e + 2 fiR + r 


^ m = 0 11-: 
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+ (-rr?+-'«-')*-° ,1.40 

The set of equations 11-37, 11-39, 11*40 will possess satisfactory solu- 
tions only if the parameters X, r, and e have certain definite values. 
The set of simultaneous equations has been solved by Teller, 4 and, for 
the ground state, by Burrau, 6 Hylleraas, 6 and Jaffe, 7 leading to results 
in complete agreement with experiment. The solution of the y? equa- 
tion leads to the familiar result 


where X takes on positive or negative integral values. The energy 
depends on X through |x|, since only X 2 occurs in the equations which 
determine the energy parameter e . For r a b = 0, the wave equation 
is the same as that for He + , except for the difference in nuclear mass. 
The quantum number X is a “ good ” quantum number at all inter- 
nuclear distances, since the equation in <p can always be separated 
from the remainder of the wave equation. For r a & = 0, that is, for 
the united atom, X becomes equivalent to the atomic quantum num- 
ber m. In describing the molecular orbital, it is therefore character- 
ized by the designation of the state of the united atom to which it 
reduces, 2s, 2p, etc., plus the designation of the X value; the symbols 
a, 7 r, 8 * • • denote jx| = 0, 1, 2 - * • . Since, for the united atom, 
m = l, l — 1 • • • — l, the possible molecular orbitals are lscr, 2sa, 2pa, 
2pir , 3 sa, Spa, Spw, 3 da, Sdir, 3 dd - * • . The a states are non-degener- 
ate, the tt, 8, etc., states are doubly degenerate because of the equiva- 
lence of the two values d=X. In Figure 11-3 the electronic energies of 
several of the lowest states of H 2 + are plotted; in Figure 11*4 the total 
energy for the ls<r and 2 pa states are plotted as a function of the inter- 
nuclear distance. 4 It will be noted that the state Isa corresponds to 
the state {^<*(1 s) +^&( Is)} of our earlier treatment; the state 2p<r 
corresponds to {^ 0 (ls) — fo(ls)}. 

lid. Homonuclear Diatomic Molecules. At this point it is of 
value to consider, on the basis of group theory, the possible states of a 
homonuclear diatomic molecule. Such molecules belong to the sym- 
metry group Dooa; the characteristics of the possible states, including 
their degeneracies, are determined directly from the character table 

4 E. Teller, Z. Physik, 61, 458 (1930). 

6 0. Burrau, Kgl. Danske Videnskab. SeLskab 7, 1 (1927). 

e E. Hylleraas, Z, Physik , 71, 739 (1931). 

7 G. Jaffe, Z. Physik, 67, 535 (1934). 
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Fig. 11-3. Calculated electronic energy of H 2 + . 
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of the irreducible representations of this group. This character table 
is shown below. The symbols in the first column are those used to 
describe the electronic states of homonuclear diatomic molecules. 



E 

2 C v 


iE 

2 iC v 



1 

1 

1 

1 

1 

1 

22- 

1 

1 

1 

-1 

-1 

-1 

2? 

1 

1 

-1 

I 

1 

-1 

27 

1 

1 

-1 

-1 

-1 

1 

n. 

2 

2 cos (p 

0 

2 

2 cos y? 

0 


2 

2 cos <p 

0 

— 2 

—2 cos*> 

0 


2 

2 cos 2<p 

0 

2 

2 cos 2 ip 

0 

A u 

2 

2 cos 2 <p 

0 

-2 

— 2 cos 2*> 

0 


States which are invariant under rotation about the symmetry axis 
are called 2 states. If only one electron is present, these are the states 
for which X = 0; if more than one electron is present, these are the 
states for which A = XX = 0. n states are those for which A = 1; 

t 

A states those for which A = 2, etc. If only one electron is present, 
these are equivalent to the i r and h states discussed above. In addition, 
the 2 states are characterized by the designation + or — according 
to the manner in which they behave when subjected to the operation 
a v , which is a reflection in a plane in which the symmetry axis lies. All 
states are further characterized by the symbols g (“ gerade ”) and u 
(“ ungerade ”). These symbols tell whether the wave function re- 
mains invariant or changes sign upon inversion at the center of sym- 
metry. For the special case of the H 2 + ion, we readily see that <r 
orbitals give 2 + states, n orbitals give II states, etc. The g or u prop- 
erty is independent of the internuclear distance. If the molecular 
orbital is written as a linear combination of atomic orbitals, then we 
immediately see that combinations of the type \yp a + &>} are g, those 
of the type {\p a — tb) are u. For the united atom, the center of sym- 
metry becomes the atomic nucleus; a molecular orbital thus has the 
same g or u property as the atomic orbital to which it reduces. For 
a one-electron atom, the wave function is g or u according as l is even 
or odd; for a many-electron atom, the atomic wave function is g or u 

according as XX is even or odd. We thus see that a 2 sa orbital gives 

* 

a 2+ state, a 2pa orbital gives a 2* state, and a 2pir orbital gives a 
n M state. Referring to Figure 114, we note that in H 2 + the 2^ state 
is stable, the 2+ state is unstable. 

It is possible to make a unique correlation between the states of the 
united atoms and the states of the separated atom by means of the 
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theorem which says that two levels with the same symmetry properties 
cannot cross as the internuclear distance is varied. The following 
proof of this theorem is that given by Neumann and Wigner 8 as modi- 
fied by Teller . 9 Suppose that we know all the electronic wave func- 
tions except two. These two may be written as linear combinations 
of the functions fa and ^2, which have been chosen to be mutually 
orthogonal ,and orthogonal to all the remaining wave functions. The 
energy levels are given by the solutions of the equation 

H\\ — E Hu 

= 0 

H 12 H22 — E 

In order that the two roots be equal, the conditions Hu = H 2 2, #12 = 0 
must be satisfied simultaneously. If 4 *i and have different symmetry 
properties, then #12 is identically zero. Since Hu and H22 are func- 
tions of the internuclear distance, it is possible for the two to be equal 
at some distance; when this is true, the two energy levels are equal, 
and crossing can occur. If and have the same symmetry, H 12 
will not be zero. It will not, in general, be possible to satisfy the two 
conditions by varying one parameter, hence the two energy levels 
can never be equal, and crossing is impossible. 

Since X is a good quantum number at all times, and since the g and 
u properties must be preserved as the internuclear distance is varied, 
the correlation between the states of the united atom and those of 
the separated atom must be c g — * <r g , <r u — * <r u , ir g — » ir g , etc. Further, 
two <r g orbitals cannot cross; neither can two <r u orbitals, etc. In 
Figure 11 - 5 , we have on the left the various states of the united atom 
and the possible molecular orbitals into which they can split; on the 
right we have the various states of the separated atoms and the sym- 
metry properties of the molecular orbitals which can be formed by 
taking linear combinations of the atomic orbitals. The correlation 
between the two sets of states is given by the connecting lines which 
were drawn in accordance with the above rules. It will be noted that 
this schematic representation corresponds exactly (as of course it 
must) with the exact energy level diagram for H2 + (Figure 11 - 3 ). 

In determining the electron configurations of complex atoms, we 
added the electrons to hydrogenlike orbitals, placing the electrons into 
the lowest orbitals allowed by the exclusion principle. Similarly, for 

8 J. v. Neumann and E. Wigner, Phyaik. Z., 80, 467 (1929)* 

* E. Teller, J. Chem. Phya., 41 , 109 (1936). 
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complex homonuclear diatomic molecules, we add the electrons to 
the lowest allowed H 2 + -like molecular orbitals. Taking account of 
electron spin, the exclusion principle allows us to place two electrons 
in a orbitals, and four electrons in «r, S • • • orbitals, because of the 
two possible values of X in the latter. This procedure gives us the 
electron configuration of the lowest state, which will then be char- 
acterized by the values of A = XX,- and S = £s,-. States with S — 0, 

i i 

1 • • • are called singlet, doublet, triplet, • • • states, just as for atoms. 



This procedure will be perhaps clarified by several examples. For the 
simplest homonuclear diatomic molecule, H 2 + , the electron configura- 
tion is (<r„l «), the notation referring to the states of the separated atoms, 
which gives A = 0, S - |, so that the state is 2 2/. (o- orbitals can, 
of course, give only + states; rr, S • • • orbitals can lead to either + 
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or — states. If the number of electrons in u orbitals is even, the 
resulting states will be g; if the number of electrons in u orbitals is 
odd, the resulting states will be u .) For H 2 , the electron configura- 
tion is (v g l$) 2 > so that A = 0, S = 0 (because of the exclusion prin- 
ciple), and the ground state is *2*. For Li 2 , the electron configura- 
tion is (^ls) 2 (<r u l5) 2 (<r (7 25) 2 ; the corresponding state is l 2+, For 
more complex molecules, there is a possible ambiguity in the order of 
the orbitals, as this order sometimes changes as the intemuclear dis- 
tance is varied. For these molecules it is necessary to resort to ex- 
perimental evidence to determine the electron configuration of the 
ground state. For example, the electron configuration of N 2 is found 
to be (<r 0 ls) 2 (cr u ls) 2 (<rg2s) 2 (<r u 2s) 2 (Tr u 2p) 4 (<rg2p) 2 . Only closed shells 
of electrons are involved, so that the state represented by this con- 
figuration is x 2*. In 0 2 , the two additional electrons go into the 
(ir g 2p) orbital. The quantum number A can be either 0 or 2, depend- 
ing on whether or not the X’s are directed oppositely or are parallel. 
If A is 2 the spins must be opposed; if 0 they may be either parallel 
or opposed. The configuration • • • {ir g 2p) 2 thus leads to x 2, 3 2, and 
l A states. All states are, of course, g; the detailed theory 10 shows 
that they are in fact 3 27, and 1 A g . The lowest state of 0 2 is 
found experimentally to be the 3 2^ state; as a result, 0 2 is para- 
magnetic (Chapter XVII). 

An interesting interpretation of the character of the chemical bond 
in diatomic molecules can be given in terms of the above considerations. 
If an orbital maintains the same principal quantum number as the 
transition is made from the separated atoms to the united atom, it is 
said to be a “ bonding orbital.” If the principal quantum number 
increases, it is said to be an “ anti-bonding orbital,” and an electron 
in an orbital of this type is said to have been “ promoted.” As may 
be seen in Figure 11-4, occupied bonding orbitals will tend to form 
stable states; occupied anti-bonding orbitals will tend to form un- 
stable states. The difference between the number of pairs of electrons 
in bonding orbitals and the number of pairs in anti-bonding orbitals 
may be regarded as the effective number of “ electron pair ” bonds. 
For Li 2 , 0 2 , and N 2 this difference is seen to be 1, 2, and 3, respectively, 
which corresponds to the usual designation of the bonds in these mole- 
cules as single, double, and triple. 

lie. Heteronuclear Diatomic Molecules. For heteronuclear dia- 
tomic molecules we no longer have the center of symmetry which was 
present in the homonuclear diatomic molecules. Heteronuclear dia- 
tomic molecules belong to the symmetry group C«> V} for which the 

10 E. Wigner and E. Witmer, Z . Phytik , 61, 869 (1928). 
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character table is 

E 2C V <Ty 
2 + 1 1 1 

2 - 1 1 -1 

II 2 2 cos <p 0 

A 2 2 cos 2 <p 0 


We note that we have the same types of possible states as for the group 
D<x>hj except that the g and u property has been lost. In drawing *the 
correlation diagram analogous to Figure 11*5, we must take into 
account the fact that there are now, for example, two Is states for the 
separated atoms, since, if we designate the atoms by a and 6, the atomic 
orbitals ^ a (l$) and foi Is) will have different energies because of their 
different nuclear charges. Let us suppose that we again write the 
molecular orbital as a linear combination of atomic orbitals, for ex- 
ample 

* = emails) +C»h(U) 1141 

The secular equation is 

Haa - E Hab - SE 

= 0 11 42 

H a b ~ SE Hbb ~~ E 

where the symbols have their usual meaning. We now wish to inves- 
tigate the values of the coefficients c a and c&. We imagine the inter- 
nuclear distance to be sufficiently large so that the overlap integral S 
may be taken equal to zero. We further assume that a has the greater 
nuclear charge, so that H aa > Hbb • The energy eigenvalues, given 
by the solution of 11*42, are 

E = J{ (H aa + H bb ) ± V(H aa - H bb ) 2 + 4 !&} 1143 

For H aa = Hbb, the upper sign gives c a = c&; the lower sign gives 
c a = —C6. For H aa > Hbb we may therefore conclude that the upper 
sign corresponds to the case where c a and have the same sign; the 
lower sign corresponds to the case where c a and have opposite signs. 
Further, for the limiting case where H a b = 0, the upper sign gives 
E = H aa , c^ = 1, 4 = 0; the lower sign gives E = Hbb, 4 = 0, 
4 = 1. For any internuclear distance we therefore conclude that 
the proper linear combinations of atomic orbitals are 

= emails) + 4'l'b( Is) 

hi « c'Va(ls) ~ Cb'M Is) 

where 4 > 4; 4' > 4'. Using the correlations <r— xr, ir — >w, etc., 
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and noting that two a or two v orbitals cannot cross, we obtain the 
correlation diagram for heteronuclear diatomic molecules given in 
Figure 11*6, where atom a is considered to have the greater nuclear 
charge. The method of determining the electron configuration for 
the ground state is identical with that applied previously. For ex- 
ample, LiH should have the electron configuration (lscr) 2 (2s<r) 2 if we 
use the united atom notation, which leads to the ground state 1 2 + . 


3 dS 



2 s 2 sa [p Is)* (ls) 6 



United Atom Separated. Atom* 


Fig. 11-6. Correlation diagram for unlike nuclei. 

If we consider the molecule (LiH) ++ , we see that the electron con- 
figuration for the ground state is (Iso-) 2 . Upon separation of the 
nuclei, Figure 11-6 tells us that we obtain Li + and H + ; that is, both 
electrons remain on the Li nucleus, which is in accord with our con- 
clusion that Ca > <£. However, if we separate the nuclei in LiH, the 
correlation diagram states that we would obtain Li + and H - . Actu- 
ally we would obtain Li and H. These considerations suggest that 
we would obtain better molecular orbitals by the following procedure. 
We consider that the Is electrons of Li are unaffected by the formation 
of the molecule, and we form molecular orbitals, not between Li +++ 
and H 4 , but between Li + and H + . The problem then becomes quite 
similar to that of the hydrogen molecule; we take as the molecular 
orbital which leads to a stable molecule the linear combination 

rp - a^ u (2s) + 


1145 
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This orbital can, of course, be occupied by two electrons, provided 
that their spins are antiparallel. 

The above considerations have enabled us to determine the types 
of states that may arise in diatomic molecules and have shown us how 
to write approximate orbitals to describe these states. It cannot be 
expected that the use of these simple orbitals will give quantitatively 
correct results. In the next chapter we shall see what results can be 
expected in the simpler cases, and, on the basis of these results and the 
above considerations, we shall attempt to develop a satisfactory quali- 
tative theory of valence. 



CHAPTER XII 


THE COVALENT BOND 


12a. The Hydrogen Molecule. For the hydrogen molecule, the 
Hamiltonian operator, in atomic units, is 


H = 


1 

2 


n V? + r T 2 + 


Ral Ra2 Rbl Rb2 R 12 



12- 1 


where the subscripts 1 and 2 refer to the electrons and the subscripts 
a and b refer to the nuclei. Recording to the results given in the pre- 
vious chapter, J the lowest molecular orbital, expressed as a linear com- 
bination of the atomic orbitals of hydrogen, is 


tu* = ^o(ls) + &(!*) 


12*2 


We can, according to the exclusion principle, place two electrons in 
this orbital, with their spins opposed. If we designate the electrons 
by 1 and 2 , the unnormalized wave function for the ground state of 
the hydrogen molecule would then be, to this approximation, 


i - {MD +*(1)}{*.(2) +M2)} 12-3 

where 

Ml) = 4 =e "*‘ 1 ; = 4 = etc. 

Vtt V 7 T 


j* i/'Hi/' dr 

The energy E = of the ground state of hydrogen as given 

Jwdr 

by this wave function has been calculated approximately by Hell- 
mann . 1 He finds the equilibrium distance to be R ~ I. 600 ; the dis- 
sociation energy to be ~2.65 e.v. — values which are not at all in 
agreement with the experimental values R — 1.40oo; D = 4,72 e.v. 
One reason for tins disagreement may be seen immediately from the 
following argument. If we multiply out equation 12 - 3 , we obtain 

=■ lAo(l)lAo(2) + 'Aa (lj’/'s (2) + ’Af>(l)’/'a(2) + iA&(l)iA&(2) 12-4 

1 H. Hellmann, Einfuhrung in die Qnantenchemie, p. 133, Franz Deuticke, 1937. 
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rhe first and last terms in this expression represent electron density 
distributions in which both electrons are on the same hydrogen nucleus; 
that is, they represent ionic states such as H + H“. Since it is known 
that the electron affinity of hydrogen is very much less than the ioni- 
zation potential of hydrogen, we would expect that such states are not 
very stable, and hence that we might obtain a better representation 
of the ground state of the hydrogen molecule by dropping these terms. 
This leads us to the function used by Heitler and London 2 in the first 
successful attack on the problem of chemical valence. These authors 
wrote 

t-Miyhw+Miy+aW 12.5 

as the function representing the ground state of the hydrogen mole- 
cule. The energy corresponding to this function may be written as 

r- J ' + K ' , 0 . 

E - 12-6 

where 

f = f *.(Dfc(2)H*.(l)fc(2) d n dr 2 

K = S dn dr2 

s 2 = ftaWMVMVMDd^d'^ 

From the form of H and the functions ^ 0 (1), etc., it is readily seen 
that the energy may be written as 

E ~2E u (EL) + Q + a 12-7 

where 


J = 


1 + s 2 


1 + s 2 


_L + _L + JL' 

Ra2 Rl2 Rob, 


= ~2 € aa + n ^ f 

Kab ** 


Rbl Ra2 Rl2 
r {MD} 2 lM2)} 2 
' Rl2 


$a{ 1)^6 (2) dr 1 dr 2 


dr\ dr 2 


k= f ul)M 2 )\-±~-L + 1 L + -L 

J I K>al llb2 rtl2 Kab. 

s 2 na , rtaiDMVMVMD 

- ~ 2Stab + / p 

itab »/ tt 10 


^o(2)^fc(l) dri dr 2 


dr 1 dto 


2 W. Heitler and F. London, Z. Physik, 44, 465 (1927). 
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The integrals S, too, 6 a t have been evaluated in section lib. The re- 
maining integrals have been evaluated by Heitler and London 2 and 
by Sugiura. 3 The results gave for the intemuclear distance and the 
dissociation energy the values R = 1.64o 0 , D = 3.14 e.v. Although 
this value of the dissociation energy is only slightly better than that 
obtained from the strict molecular orbital treatment, the Heitler- 
London method is somewhat easier to handle than the molecular 
orbital method, and we shall make considerable use of this method of 
writing approximate wave functions in the following pages. 

It is of interest to investigate the state of the hydrogen molecule 
that arises when one electron is placed in the lsv orbital and the other 
in the 2 pa orbital. The wave function is then 

* = {*.(1) + *6(1) I {*«( 2) ~ *6(2) } 12-11 

which, if we drop the ionic terms as in the Heitler-London approxima- 
tion, may be written as 

* = *«(1)*»(2) - * a (2)*i(l) 1212 

For this state, the energy is 

E = 22? lt (H) + Q' - a 12-13 

where 



and where the remaining symbols have the same significance as before. 
The quantities Q and a are called the “ coulombic ” and “ exchange ” 
energies, respectively; the integrals J and K are called the coulombic 
and exchange integrals. The state represented by 12-12 is unstable. 
It is noted that the stability of 12-5 relative to 12-12 is due essentially 
to the difference in sign of the exchange energy, the binding energy of 
12-5 being Q + a, and of 12-12 being Q' — a. In Figure 12-1 the cou- 
lombic energy and the total energy, as calculated from the above equar 
tions, are plotted as a function of the intemuclear distance. It is 
observed that the greater part of the binding energy arises from the 
exchange term, the coulombic energy being only about 10-15 per cent 
of the total. We further note from equation 12-10 that the exchange 
integral K will be at least roughly proportional to the overlap integral 
S, so that if two orbitals overlap only slightly the exchange integral will 
be snail, and hence the binding energy of the two orbitals will be small. 
This result will later become of great importance. 

' Y. Sugiura, Z. Phyrik, 46 , 484 (1937). 
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We have as yet made no explicit mention of electron spin. To the 
approximation that we have been using, the Hamiltonian operator 
contains no terms dependent on spin, so that the spin wave functions 
and the orbital wave functions are separable. As before, we designate 
by a the spin eigenfunction which has the eigenvalue +J, and by 0 



Fra. 12*1. Calculated eoulombic and total energies of Hj as functions of the 

interatomic distance. 

the spin eigenfunction which has the eigenvalue — f. If we have two 
electrons, we can form one spin eigenfunction 

{<*(1)0(2) - « (2)0(1)} 

which is antisymmetric in the electrons, and three spin eigenfunctions 

{«(1)«(2)} 

» . ( 0 ( 1 ) 0 ( 2 )} 

{<*(1)0(2) + a(2)0(l)} 

which are symmetric in the electrons. Of the orbital wave functions 
which we have thus far considered for the hydrogen molecule, 12-5 is 
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symmetric in the electrons and 12-12 is antisymmetric in the electrons. 
According to the exclusion principle, the complete wave function for 
a system must change sign if two electrons are interchanged; in order 
to form acceptable wave functions of the above type we must there- 
fore combine a symmetrical orbital with an antisymmetrical spin func- 
tion, and conversely. We thus obtain the complete wave functions 

= {MDMV + *.(2)fc(l)| {a(l)0(2) - «(2)0(1)} 12-14 

l«(l)«(2)} 

**+ = iMWb(2) - t«(l)0(2) + «(2)/3(l)l 12-15 

L { j8(l)|8(2)} 

We note that in the stable state 1 2j~ the spins are opposed; in the 
unstable state 3 2t they are parallel. The general theory of valence 
will be based almost entirely upon the material thus far presented in 
this and the preceding chapter. Before beginning this general discus- 
sion, we shall see how the above method of approach should be modified 
in order to obtain better quantitative agreement with experiment. 

Several simple modifications of the function 12-5 have been made. 
A considerable improvement in the calculated dissociation energy is 
obtained by introducing an effective nuclear charge as a variational 
parameter, that is, writing 

/ „ 3 \H 

MD r [-) etc - 

This function, with a = 1.17, gives D — 3.76 e.v. 4 A more general 
function is obtained by writing the unnormalized function as 

M - (1 + 

where the nuclei lie along the z axis, that is, including the 2 p, hydrogen 
orbital in the variational function in order to take account of the fact 
that one hydrogen atom will polarize the other. With a = 1.17, 
ci = 0.10, a value D = 4.02 e.v. is obtained for the dissociation energy. 8 
If in addition the ionic terms in 12-4 are included, multiplied by a 
parameter C 2 , then, with a = 1.19, ci = 0.07, C 2 = 0.175, the dissocia- 
tion energy is calculated 6 to be D = 4.10 e.v. as compared with the 
experimental value D = 4.72 e.v. This small value for C 2 is further 
confirmation of the fact that, for the hydrogen molecule at least, the 

4 S. Wang, Phys. Rev., 31, 579 (1928). 

6 N. Rosen, Phys. Rev., 88, 2099 (1931). 

• 8. Weinbaum, J. Chem. Phys., 1, 317 (1933). 
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Heitler-London or “ valence-bond ” method is superior to the method 
of molecular orbitals. 

As for H 2 + , it is again found that an accurate value of the binding 
energy can be obtained only by a method of approach which is not 
based upon the use of one-electron approximate wave functions. 
James and Coolidge 7 used a variation function which was written as 
a function of elliptic coordinates and which included the interelectron 
distances R\ 2 explicitly. These authors investigated the function 

* = e- J ^> E c klmnp M^ P + /wWu P ! 12-16 

klmnp 


where 


Ml = 


Ra l + Rbl 
Rab 


M2 = 


Ra 2 + Rb2 
Rab 


V\ 


R , 


a 1 


Rbl 


Rab 


Vo ~ 


Rg 2 Rb2 

Rab 


2R 12 

Rab 


The form of the function is such that it is symmetric in regard to inter- 
change of electrons. In order that it may also be symmetric in the 
coordinates of the nuclei, only those terms which have (m + n) an 
even interger were included; the indices were taken to be positive 
integers or zero. With 6 = 0.75, the calculations, for a thirteen- 
term function, gave R a b = 1.40 a 0 and D = 4.698 e.v., in essentially 
complete agreement with experiment. The accuracy could no doubt 
be further improved by the inclusion of additional terms. We have 
had several examples (He, H 2 + , H 2 ) of the exact results which can be 
obtained by means of the variational method with a wisely chosen 
variation function. However, the labor involved in these calculations 
is so great even for these simple systems that it does not appear to be 
a profitable method of attack on molecular problems in general. Be- 
cause of the mathematical difficulties involved, we are forced to use 
much less accurate approximations; usually we are forced to write the 
wave function as some linear combination of one-electron wave func- 
tions. Although these will not give satisfactory quantitative results, 
they should in general be qualitatively correct, and should enable us 
to correlate experimental chemical facts. It is to be noted that the 
James and Coolidge treatment of H 2 contains nothing corresponding 
to the separation of the binding energy into a coulombic part and an 
exchange- part which appeared in the Heitler-London treatment. 
This separation is a mathematical result of the use of one-electron 
orbitals in forming the valence-bond function. Thus, though we shall 

7 H. James and A. Coolidge, J. Chem . Phys. f 1, 825 (1933). 
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continually use the terms “ coulombic energy ” and “ exchange en- 
ergy/ J the reader should remember that these terms have more of a 
mathematical than a physical significance. It might be added, how- 
ever, that in principle any wave function can be written in the form 

9 

where P ¥ is the operator which interchanges the subscripts of the sets 
of quantum numbers a}, etc., this form of function being that which 
has the proper symmetry in regard to the interchange of like particles. 
The associated energy will then be a sum of integrals. Certain of these 
integrals will be of the type 

f. f(a i <4, • * * a”)H/(< o|, • • • a”) 

which may be called coulombic integrals; others will be of the type 

f. f(a\, <4, • • ' «C)H f(al, a 2 !,--- a”), etc. 

which may be called exchange integrals. 

12b. The Covalent or Electron-Pair Bond. 8 According to the 
Heitler-London theory, which gives a satisfactory qualitative descrip- 
tion of the hydrogen molecule, the covalent bond in this molecule is 
represented by the orbital wave function 

{MVMV +MVMI)} 12*17 

where the functions &*(!), etc., have the significance previously stated. 
This orbital wave function must be combined with the spin wave 
function 

{<*(1)0(2) - <*(2)0(1)} 

representing oppositely directed spins, in order that the complete 
wave function be antisymmetric in the electrons. Parallel spins lead, 
as we have seen, to unstable states. The valence bond in LiH, ac- 
cording to the discussion in the last chapter, will be represented, in 
the molecular orbital method, by the function 

iatui 1) + ^H(DHafe(2) + ty h (2)} 1248 

or, in the Heitler-London method, by the function 

{fc( 1 )^h( 2) + ^Li(2)^H(l)} 12-19 

8 The methods employed in this section are essentially those employed by Pauling. 
See L. Pauling, Nature of the Chemical Bond , Cornell University Press, 1940. 
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where ^Li is the 2s wave function of Li and is the Is wave function 
of H. The mathematical treatment follows the same lines as for H 2 ; 
the function 12*19, which must be combined with an antisymmetric 
spin function representing oppositely directed spins, leads to a stable 
molecule. 

The Heitler-London method is obviously not limited to the treat- 
ment of the formation of the bonds in diatomic molecules. In CH 4 , 
for example, if we let fai, ^ 02 , ^C 3 > be the orbitals occupied by 
four of the electrons of the carbon atom, and ^ H i> ^H 2 , ^h 3 , ^H 4 be 
the Is orbitals of the four hydrogen atoms, then the set of functions 

I*ciU)*hi( 2) +*ci(2)ftii(l)} 

{^C2(3)^H2(4) + fe(4)^H2(3)} 

ffc)(5)vW6) + fe(6)*H3(5)} 12,20 

{*C4(7)*H4(8) + *C4(8)*H4(7)} 

would represent four covalent bonds formed by eight electrons. With- 
out at this time further specifying the nature of the four carbon orbitals, 
we can state that they will be orbitals with the principal quantum 
number n equal to 2. The remaining two electrons of carbon will be 
in the Is orbital and will have their spins paired; as each of the four 
valence electrons has its spin paired with an electron from a hydrogen 
atom, we see that all electron spins in the molecule are paired, so that 
the resultant spin S of the molecule is zero. Similarly in LiH and 
in H 2 all the electron spins are paired. The experimental fact that 
stable molecules (with a very few exceptions) are non-paramagnetic 
(see Chapter XVII) is a confirmation of the above-derived results that 
the resultant spin should be zero for molecules in their ground states. 

An electron which has its spin paired with another electron from 
the same atom obviously cannot take part in the formation of a co- 
valent bond, so that the covalency of an atom is equal to the number 
of electrons with unpaired spins possessed by the atom. The results 
of this rule have been mentioned in Chapter IX; we shall not repeat 
that discussion at this point but shall consider a number of examples 
in greater detail. 

The electron configuration of the ground state of the nitrogen atom 
is (l$) 2 (2s) 2 (2p) 3 ; the lowest term is 4 S, indicating that the spins 
of the three 2 p electrons are parallel. Nitrogen should thus have a 
covalence of 3, in accordance with experimental facts. The three 
valence orbitals may be written as ^ 2Px) $ 2 p v , fap/, if we were to carry 
out actual numerical calculations, we would use, for example, Slater- 
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type eigenfunctions with the effective nuclear charge appropriate for 
the 2 p orbitals of nitrogen. If we wished to describe the valence 
bonds in NH3, then, according to the Heitler-London theory, we would 
write 

{*2p,( l)fe(2) + ^2 p x (2)^h(1)} 12-21 

with analogous expressions for the two remaining bonds. According 
to the results of the similar calculation for H 2 , the energy of the bond 
represented by 12*21 is determined largely by the value of the exchange 
integral between \p 2Px and ^h* This exchange integral is, as we have 
seen, proportional to the overlap integral between \f/ 2Px and Now 
^2 p x has its maximum value along the x axis. Consequently the over- 
lap integral and hence the exchange integral will have their maximum 
values when the hydrogen atom lies along the x axis; that is, this 
particular N — H bond will be stronger when the hydrogen atom is on 
the x axis. The same argument can be carried out for the other bonds; 
the stable configuration for the ammonia molecule is thus that in which 
the three hydrogen atoms lie along the x, y, and z axes, respectively. 
Since the three bonds are equivalent except for orientation in space, 
this simple theory of directed valence predicts that the ammonia mole- 
cule should be a triangular pyramid, with all H — N — H angles equal 
to 90°. The experimental value is about 108°; the manner in which 
the simple theory must be modified will be discussed later. 

The oxygen atom has the electron configuration (l$) 2 (2$) 2 (2p) 4 
the lowest term being 3 P. Since there are four electrons to be placed 
in the three 2 p orbitals, one of the orbitals, which we may take to be the 
2 p z orbital, must be occupied by two electrons with their spins paired. 
The two valence orbitals are then ^ 2Px and \po Vy ; the arguments used 
above for NH 3 lead us to expect that the H — O — H angle in H 2 0 would 
be 90°. The actual angle is 105°, indicating that the simple picture 
of the oxygen valence also requires some modification. 

The description of the covalent bonds formed by the carbon atom 
is less simple than the description given above for nitrogen and oxygen, 
and it requires a new concept: that of the formation of valence orbitals 
by the “ hybridization ” of the simple atomic orbitals. The lowest 
electron configuration of carbon is (ls) 2 (2s) 2 (2p) 2 ; the lowest term 
is 3 P. If this electron configuration represented the valence state 
of carbon, it would be divalent, with the spatial distribution of the 
bonds similar to that of oxygen, in complete contradiction to experi- 
mental facts. However, the first excited electron configuration of 
carbon is (ls) 2 (2$)(2p) 3 ; the lowest term arising from this electron 
configuration is 5 S; carbon in this state had four electrons with un- 
paired spins and hence has a valence of 4. This excited state, arising 
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from the “ promotion ” of a 2s electron into the 2 p level, has not been 
definitely located experimentally, but according to calculations by the 
Hartree method, as well as certain indirect experimental evidence, 
the 5 S term lies about 3-4 e.v. above the ground state. 9 There is 
probably no very close correlation between this energy difference in 
the carbon atom itself and the energy required to “ promote ” a 2s 
electron when a compound such as CH4 is formed, since the perturba- 
tion of the energy levels of carbon by the hydrogen atoms will be ex- 
ceedingly large. This promotional energy would be more than com- 
pensated for by the energy furnished during the formation of the two 
additional covalent bonds. It may therefore be regarded as a reason- 
able assumption that quadricovalent carbon is represented by the 
electronic state (ls) 2 (2s)(2p) 3 , 5 S. 

The four valence orbitals of carbon could then be written as 
fc p a y fopy, fop,. However, we need not limit ourselves to this particu- 
lar set of four valence orbitals but may use linear combinations of 
them. We will therefore take as our valence orbitals the set of linear 
combinations ^1, ^2, ^3, ^4, the combinations being formed in such a 
way that they fulfill the necessary requirements of being normalized 
and mutually orthogonal. In addition, we will require the four va- 
lence _orbitals to have the maximum possible bond-forming power. 
According to our previous discussion regarding exchange and overlap 
integrals, this means that we want jfor example, to be that linear 
combination of J / 2 Px , 'p2 Py , fap, which has the largest possible 
magnitude along some arbitrary direction in space, subject, of course, 
toJEhe condition tba_tit be normalized. The functions ^2 Py , 

fap. may be written 

fas = #2<r(r) 

^2 P , = R2p(r) sin e cos <p 
fcpy = #2p(r) sin 6 sin <p 
$2 Pt = R2p(r) cos 0 

The radial functions / 2 2 «(r) and jR 2p (r) will not differ greatly; if we 
use Slater-type eigenfunctions the two are identical. Assuming the 
radial functions to be identical, and writing only the angular parts of 
the wave functions, we have 

*2. = 1 ** = V 3 sin 0 sin ^ 

_ T 12*23 

fcp, — *3 sin 9 cos <p fa p , = V 3 cos 6 

where the angular parts of the wave functions are normalized to 4ir. 
•aW. Ufford, Phys. Rev., 53, 668 (1938). 
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In accordance with the result that the strength of the bond formed 
by two valence orbitals will be proportional to the overlap of the two 
orbitals, Pauling 10 has defined jbhe bond-forming strengths of the above 
orbitals to be 1 for fca and V3 for >*> and these being the 
maximum values possessed by the angular parts of the wave functions. 

We now write 


ypi = <pp2a + + <^2 p v + dfap, 12-24 


where the possible values of the coeffi- 
cients are restricted by the normalization 
condition a 2 + b 2 + c 2 + d 2 = 1. The di- 
rection in space in which this first orbital 
has its maximum value is arbitrary. We 
7 choose this direction to be along the (1, 1, 1) 
diagonal of a cube with the carbon atom 
at its center and with the x, y , and z axes 
parallel to edges of the cube, as illustrated 
in Figure 12-2. This choice of direction 
Fig. 12*2. Coordinate system requires the coefficients b, c, and d to be 
for tetrahedral carbon orbitals, equal, so that \pi may be written as 



== a^2« + b(\[/2p x + ^2p v + ^2p,) 


12-25 


with the normalization condition a 2 + 3b 2 = 1. Along the (1, 1, 1) 

diagonal, sin tp = cos <p = -7= > cos 0 = —7^ , sin 9 = -—i , so that 

V2 V 3 V3 

^2 p a = = $ 2 Pt = 1 along this diagonal. The bond-forming strength 

of fa is t hus (a + 3b), or, by use of the normalization condition 
(a + vWl — a 2 ). The condition that the bond strength be a maxi- 
mum gives us the relation 

4 - (0 + V3V1 - a 2 ) = 0 

da 


so that a — \ . The condition o 2 + 3b 2 = 1 gives 6 = 5; the valence 
orbital fa is thus 

fa = h(fa, + fap z + fa Py + fa P.) 12-26 

This valence orbital, which has the bond-forming strength 2.00, has the 
maximum strength which can be obtained from any linear combina- 
tion of 2s and 2p orbitals. 

10 See page 78 of reference 8. 
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If we form the set of valence orbitals 

*<1, 1, 1) = $(*. + *2 p, + 4'2p y + *2V.) 

iK 1 ) ~1; -1) - \{p2» + h p x — fapy — $2 p.) 12 27 

V'C Ij 1» 1) = 5(^2« — + 'Pipy ~ fop.) 

H~ 1, -1, 1) = 5(^28 - i>2p x — i2p y + p2 p.) 

we note that these functions are normalized and mutually orthogonal, 
each function having a bond-forming strength of 2.00 (the maximum 
possible value for 2s and 2 p functions), with the maximum density 
along the diagonal of the cube indicated. All members of the set are 
equivalent except for orientation in space. Each member of the set 
has its maximum value along one of the lines from the carbon atom to 
a comer of a regular tetrahedron with the carbon atom at its center. 
This set of orbitals, determined by the condition that their bond- 
forming power should be a maximum, thus gives a description of the 
valence of carbon in complete accord with the experimental facts. 

It seems probable that promotion of a 2s electron also takes place 
to a certain extent in nitrogen and oxygen. Since the strongest pos- 
sible bonds that can be formed from a combination of 2s and 2 p orbitals 
are the tetrahedral type with the coefficient of fo 8 equal to f, we 
would expect sufficient promotion in nitrogen and oxygen to make the 
bond angles tetrahedral in NH 3 and H 2 0, provided that bond strength 
were the only criterion for the type of bond formed. However, the 
necessary promotional energy increases as we go along the series, 
C, N, 0; in addition, promotion does not increase the valence of nitro- 
gen and oxygen. The experimental facts indicate that a compromise' 
is reached in N and 0; the angles are slightly less than tetrahedral, 
showing that promotion takes place to a lesser extent than would be 
necessary to give pure tetrahedral bonds. 

Actual calculations made with these valence bond functions cannot 
be expected to lead to accurate values of bond energies, nor could they 
be expected to lead to any results not given by the above qualitative 
procedure. The above procedure must be looked upon as a quantum- 
mechanical description of covalent bond formation based largely on a 
previous knowledge of experimental fact. In the next chapter we will 
find that it is possible to treat many problems in a more quantitative 
manner; we will here largely limit ourselves to this descriptive method. 

The carbon-carbon double bond, such as that in ethylene, may be 
described as follows. From fo 8y and two of the 2p orbitals, which we 
may take to be fo Pa and fo Py> we form three equivalent bond orbitals, 
which will have their maximum values in the xy plane and will be 
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separated by angles of 120°. Taking one bond direction to be the x 
direction, these bond orbitals are 


'Pi = 


vi* u + 


V2 

vi* 2p ‘ 




12-28 


* 8 ~ Vs 4 ' 2 ’ VI* 2 *' V2 * 2Pv 


The remaining carbon valence orbital is fa — ^ 2 p K - Using the first 
three valence orbitals, the carbon atom can form two C — H bonds 
and a single C — C bond. With the fourth orbital, an additional 
carbon-carbon bond can be formed with the corresponding orbital of 
the second carbon atom. The bond formed by orbitals of the type 
is called a a bond; that formed by orbitals of the type ^4 is called 
a t bond. Let us denote the coordinate system of the first carbon 
atom by x } y, z; that of the second, by x\ y'> z\ The strongest tt 
bond will be formed when the relative orientation of the two carbon 
atoms is such that fop, and ^2 p g > overlap as much as possible, that is, 
when z and z f are parallel. The stable configuration should thus be a 
planar configuration, with the H — C — C and H — C — H angles equal 



Fig. 12-3. Stable configuration of C 2 H 4 . 


to 120°, as illustrated in Figure 12*3 (the z directions are upward, per- 
pendicular to the plane of the paper). This is in agreement with the 
experimental facts for ethylene. It also gives a reason for the rigidity 
of the double bond; if we were to rotate one end of the molecule rela- 
tive to the other, we would have to supply energy to compensate for 
the weakening of the t bond caused by this rotation. Since the carbon- 
carbon <r bond is not equivalent to the C — H bonds, we would not 
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have been required to make ^2 and ^3 equivalent to yp\) in other words, 
the H — C — C angle could have been made unequal to the H — C — H 
angle. The angles in ethylene are as indicated; these angles may not 
persist in substituted ethylenes. 

In acetylene, we may describe the carbon-carbon triple bond as 
consisting of one a bond and two t bonds of the type given above 
This valence form would give acetylene the known linear configura- 
tion. Since the 2p orbitals in a t bond overlap less than the orbitals 
in a bond, we expect a tt bond to be weaker than a <j bond, and hence 
expect a C=C bond to be less than twice as strong as a C — C bond. 
The experimentally observed bond strengths are C — G, 59 kcal.; 
C=C, 100 kcal.; C=C, 123 kcal., in agreement with this expectation. 

When we come to second-row atoms, there are available for the 
formation of valence orbitals not only the 3s and 3 p orbitals, but also 
the 3d orbitals, five in number. It is thus possible to have a covalence 
greater than 4. Rather than discuss the many individual cases by 
the methods used above, we shall later present the general theory of 
directed valence and show how it is possible to predict, by application 
of group theory, the types of bonds formed by any combination of s, 
p, and d orbitals. First, however, it is of interest to see to what extent 
the above qualitative conclusions can be justified by approximate 
calculations; as an example, we consider the H 2 0 molecule. 

12c. The Quantitative Treatment of H a 0. n In the valence bond 
method, the valence orbitals of the oxygen atom may be taken to be 
^2p*(0) and ^ 2 Pv { O), those of the 
hydrogen atoms to be 
We first investigate the energy of 
the valence structure shown in 
Figure 12-4, where <p is the angle 
between the 0 — H bond and the 
x axis. For a particular valence 
structure, the energy may be 
written as 

E = Q + a 12-29 Fig. 12-4. 

where Q is the coulombic and a is the exchange energy. According to 
equation 13-30 of the next chapter, the exchange energy for this case 
will be 

a = Kis + K 24 — 2 + K 23 +--K12 + -K34) 12-30 

where the K f s are integrals of the type 12*10. The oxygen 2p x orbital 

11 J.Wan Vleck and A. Sherman, Rev. Modem Phys., 7, 200 (1935). 
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can be expressed as 

fa Px — fa pa COS tp + fa pw sin ip 12-31 

where fa pa is an oxygen 2p orbital with its axis along the O — H bond 
and fapr is an oxygen 2 p orbital with its axis perpendicular to the bond 
direction. We may therefore write A '13 as 

*13 = / fa Px (l)fa,(mfa Px (2)fa.a) dn dr 2 . 

= Kffff cos 2 <p + K xir sin 2 <p + 2 K xa sin <p cos <p 12-32 

where 

Kaa = y*^ 2 p<r(l)^l a ( 2 )H^ 2 p<r( 2 )^i*(l) dr i dr 2 

with similar expressions for the other exchange integrals between 
oxygen and hydrogen orbitals. The integral K% 2 is independent of 
<p. If we neglect the hydrogen-hydrogen exchange energy, the part of 
the exchange energy that depends on the angle <p will be 

Kv a ( 2 cos 2 <p — sin 2 <p) + K rr ( 2 sin 2 <p — cos 2 <p) 

+ 2 Km cos (p sin <p 12-33 

The integrals K xa are of the form 

J R 2 P (ri)Ru(r 2 ) 1 ^ 2 p(r 2 )Ru(ri)x'z' dr x dr 2 

where the z ' axis is along the 0 — H bond direction and R u is the radial 
part of \p Uy etc. The integrals K xa are therefore zero, since they in- 
volve odd powers of x The orbitals \p 2 V r and are orthogonal; 
the integral K xx thus contains only the electron repulsion term, so that 
this integral is positive. The orbitals \l/ 2p<r and $ u are not orthogonal; 
the integral K ffff is therefore similar to the analogous integral in the H 2 
problem and is. negative (corresponding to attraction). The value of 
<p which gives a the greatest negative value is therefore 90°. On the 
basis of the exchange energy alone, and without considering the effect 
of hydrogen-hydrogen repulsions, this calculation gives the same 
results as our previous qualitative investigation. The coulombic 
energy for this structure is 

Q = j 13 + J 24 + J 14 + ^23 + J 12 + Jz 4 12*34 

J 12 and J 34 being neglected, the coulombic energy becomes 


Q — 2 J ffff *f* 2 J ff 
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where 

M lW'i.(2)BV apr (l)^ 1 .(2) dr, dr 2 

The coulombic energy is thus independent of (p , so that only the ex- 
change energy is effective in giving directional properties to the va- 
lence bonds in H 2 O. Inclusion of the hydrogen repulsions would, of 
course, tend to increase the bond angle, as would the use of s-p hybrid 
orbitals for the oxygen atom. 

12d. The General Theory of Directed Valence. 12 In discussing the 
valence of the carbon atom, we found that it was possible to construct 
four equivalent valence orbitals from the four atomic orbitals fo 8 , 
fa Ps , fa p v j fop,, these valence orbitals having their maximum values 
in the directions of the corners of a regular tetrahedron. Similarly, 
from the orbitals fo a , fo Px , fo Vy we found that it was possible to construct 
three equivalent valence orbitals in the xy plane, separated by angles 
of 120°. We now wish to investigate the possible sets of equivalent 
valence orbitals which can be formed from any combination of $, p, 
and d atomic orbitals, that is, from any combination of some or all of 
the atomic orbitals' s, p x , p y , p z , d z % , d X2 , d vz , d xu , d x ^ y 2 . These sets of 
equivalent valence orbitals can be most easily found by means of group 
theory. Any set of such orbitals has a characteristic symmetry group. 
The set will form a basis for a representation of the group which will 
in general be reducible, but which can be expressed in terms of the 
irreducible representations of the group by means of the group char- 
acter table. The s , p, and d orbitals of the atom will also form repre- 
sentations of the group. By comparing the component irreducible 
representations of the set of valence orbitals with those of the atomic 
orbitals it will be possible to tell which combination of atomic orbitals 
will lead to a set of valence orbitals of the required symmetry. The 
general procedure can be best described by an example; we will discuss 
the possibilities of forming three equivalent coplanar bonds, separated 
by angles of 120°. 

This set of valence orbitals has the symmetry Dsh; the character 
table for this group is reproduced in Table 12*1. The table also in- 
cludes the transformation properties of the coordinates and the perti- 
nent combinations of the coordinates. The atomic orbitals are func- 
tions of r multiplied by the function written as subscript. Hence 
they transform under the operations of the group in the same way 
as their subscripts. The atomic orbitals therefore form bases for 
representations of the group as given in the table. It is now necessary 

12 G. Kimball, J. Ckem. Phys., 8 , 188 (1940). J. H. Van Vleck and A. Sherman, 
Rev . Modem Phys,, 7, 174 (1935). 
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to investigate the transformation properties of the set of valence 
orbitals. These are represented schematically in Figure 12-5. Apply- 
ing the operation E leaves the three orbitals unchanged; the char- 
acter of the representation for the operation E } applied to this set of 
orbitals which we denote by <r , is consequently 3. The operation <r *, 
reflection in the xy plane, likewise leaves the orbitals unchanged; the 
character for this operation is thus 3. The operation C 3 , a rotation 


Y 



Fig. 12-5. Valence orbitals with symmetry Dw 


by 120° about the z axis, changes the position of all orbitals, and thus 
has the character zero. The operation S 3 , a rotation by 120° fol- 
lowed by a reflection in the xy plane, likewise has the character zero. 
The operation C 2) a rotation by 180° about an axis which we may take 
to be the x axis, interchanges orbitals 2 and 3 and leaves orbital 1 un- 
changed, and hence has the character 1. Similarly, the operation 
<r„, which we may take to be a reflection in the xz plane, is seen to 
have the character 1. The character of the representation <r is given 
in the table. Either by inspection or by applying equation 10*31, we 
see that this representation may be broken up into the sum of irre- 
ducible representations 

<7 = A[ + E* 12*35 

The set of valence orbitals a can therefore be made up of any of the 
combinations 

a = s + p x + p y 

’ " * + f ’ t P / 12-36 

(7 S "T* d>xy “T“ (*x *■— y® 

<T = d, -f- dxy d x l—y l 

that is, it is possible to form the set of three equivalent valence or- 
bitals from any of the electron configurations sp 2 , sd 2 , dp 2 , d 3 . 
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Character Table for Trigonal Orbitals 





E 

<Th 

2C$ 

2Ss 

3 C' 2 

3 <T V 

8, d z 2 

1, x 2 + y 2 , z 2 

Ai 

1 

1 

1 

1 

1 

1 



A 2 

1 

1 

1 

1 

-1 

-1 



A'/ 

1 

-1 

1 

-1 

1 

-1 

Pm 

z 

Ai' 

1 

-1 

1 

-1 

-1 

1 

/ Px> Vv 1 
[dgy, d X *—y*J 

(x 2 - y 2 , xy) ( x , y) 

E' 

2 

2 

-1 

-1 

0 

0 

dgZy dyg 

(xz, yz) 

E" 

2 

—2 

-1 

1 

0 

0 



a 

3 

3 

0 

0 

1 

1 



7 r 

6 

0 

0 

0 

-2 

0 


The possibility of double bond formation can be discussed in a 
similar fashion. In describing the valence bonds in ethylene, we stated 
that a double bond can be considered as a <r bond, with valence orbitals of 
the type discussed above, plus a t bond formed by the interaction of two 
p electrons in orbitals which have their axis perpendicular to that of the 


Y 



Fig. 12-6. Double bond formation for group D#*. 

cr bond. In a polyatomic molecule, consisting of a central atom with a 
number of external atoms bound to it, bonds of this type can also be 
formed; as far as the external atoms are concerned, the condition for 
the formation of w bonds is the presence of p orbitals at right angles 
to the bond axes. Since there can be two such p orbitals per external 
atom, the configuration of these p orbitals can be represented by the 
arrows in Figure 12-6, where orbitals 1, 2, 3 are in the xy plane and 
orbitals 4, 5, 6 are parallel to the z axis. Designating, this set of or- 
bitals by the symbol ir, we proceed to find the character of the v repre- 
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sentation in the same manner as for the o representation. We have: 

E: all orbitals unchanged; character 6. 

<j\\ 1, 2, 3 unchanged; 4, 5, 6 changed in sign; character 0. 

Cz- all orbitals changed in position; character 0. 

Ss- all orbitals changed in position; character 0. 

Cz (considered as rotation about the x axis) : orbitals 2, 3, 5, 6 changed 
in position; orbitals 1 and 4 changed in sign; character —2. 

<r, (considered as reflection in the xz plane) : orbitals 2, 3, 5, 6 changed 
in position; orbital 4 unchanged; orbital 1 changed in sign; character 0. 

The character of the x representation is given in the table. 

Breaking the x representation down into its irreducible components, 
we have 

x — A-z -f- A'z' -f- E r -f- E" 12-37 

In order that x bonds may be formed, the central atom must have 
available orbitals which transform in the same manner as the p or- 
bitals of the external atoms, so that any orbital of the central atom 
which belongs to one of the irreducible representations in 12-37 may 
be used in double bond formation. Consequently the orbitals avail- 
able for the formation of x bonds are the p orbital belonging to A'z 
(p,), the d orbitals belonging to E" (d xt) d vz ), and the two orbitals be- 
longing to E' which are not used in forming the original a bonds. The 
original a orbitals probably contain a mixture of both the p and the 
d, E' orbitals, so that the x orbitals formed from these orbitals are 
probably weaker than those formed from the A'z and E" orbitals. 
We may divide x bonds into two classes, calling them “ strong ” if 
they belong to representations not used in a bond formation, and 
“ weak ” otherwise. 

The results of such calculations for a central atom surrounded by 
two to eight external atoms are shown in Table 12-2. The columns 
give the coordination number, the configuration of electrons used in 
the formation of <r orbitals, the arrangement of the bonds obtained 
from this electron configuration, the orbitals available for the formation 
of the strong and weak x orbitals, in this order. Where there is a choice 
of two or more orbitals when only one can be used, the orbitals are 
enclosed in parentheses. 

It should be noted that this method does not predict directly the 
type of bond arrangement formed from any given electron configura- 
tion; it merely tells whether or not a given arrangement is possible. 
If, as often happens, it is found that several arrangements of the bonds 
are possible for a single configuration of electrons, the relative sta- 
bility of the various arrangements must be decided by other methods, 
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such as Pauling’s strength criterion discussed earlier in this chapter, 
or considerations of the repulsions between non-bonded atoms. 

TABLE 12-2 

Stable Bond Arrangements and Multiple Bond Possibilities 


I 

II 

III 

IV 

V 

2 

8p^ 

Linear 

p 2 d 2 

— 


dp 

Linear 

p 2 Z r 

— 


p 2 

Angular 

d(pd) 

d(sd) 


ds 

Angular 

d{pd) 

p(pd) 


d 2 

Angular 

d{pd) 

p(spd) 

3 

sp 2 

Trigonal plane 

pd 2 

d 2 


dp 2 

Trigonal plane 

pd % 

d 2 


ds 2 

Trigonal plane 

pd 2 

V 2 


d 3 

Trigonal plane 

pd 2 

V 2 


dsp 

Unsymmetrical plane 

pd 2 

{pd)d 


p s 

Trigonal pyramid 

— 

C Bd)d 4 


d 2 p 

Trigonal pyramid 

— 

{ad)p 2 d 2 

4 

sp 3 

Tetrahedral 

d 2 

d 3 


d 3 s 

Tetrahedral 

d 2 

P 8 


dsp 2 

Tetragonal plane 

d 3 p 

— 


dV 

Tetragonal plane 

d 3 p 

— 


d 2 sp 

"dp 3 * 

Irregular tetrahedron 

— 



Irregular tetrahedron 

— 

8 


d 3 p 

Irregular tetrahedron 


8 


d 4 

Tetragonal pyramid 

d 

(sp)p 

5 

dsp 8 

Bipyramid 

d 2 

d 2 


d 3 sp 

Bipyramid 

d 2 

P 2 


d 2 sp 2 

Tetragonal pyramid 

d 

pd 2 


d 4 s 

Tetragonal pyramid 

d 

P 8 


d 2 p 3 

Tetragonal pyramid 
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CHAPTER XIII 


RESONANCE AND THE STRUCTURE OF 
COMPLEX MOLECULES 


13a. Spin Theory and Bond Eigenfunctions. We will now develop 
the methods which will enable us to give an approximate quantitative 
treatment of many-electron molecules. Because of the complexity of 
the problem, we again use products of one-electron eigenfunctions as the 
basis for a perturbation calculation. For a system of n atoms, each 
with one valence electron, we denote the eigenfunctions of the valence 
orbitals of the atoms by a(x, y, z ), b(x, y, z), c(x, y, z) ■ • • n(x, y, z). 
Let the coordinates of the ith electron be (x,-, y if z t ). Then a possible 
eigenfunction for the n electrons, with spin neglected, will be 
a(l)6(2)c(3) • • • n(n), where a(x i, y if «i) has been abbreviated to 
a(l), etc. Any function which may be obtained from this by permu- 
tation of the numbers 1, 2 • • • n is an equally good eigenfunction. To 
be complete, each of the above orbital eigenfunctions must be multiplied 
by a spin function of the type a(l)a(2)/3(3) • • • a{n). According to the 
exclusion principle, only those functions are allowed which are anti- 
symmetric with regard to electron interchange. As for the atom, our 
zero-order eigenfunctions are therefore the linear combinations 


<p = 


1 

Vn\ 


(aa)i (6a) 1 (a 3)t ■ • • (no) i 
(oa) 2 (ba) 2 (cp) 2 • • • (na) 2 


131 


I (aa) n ( ba) n (c/3)„ • • • (na) n I 

where (aa)i = o(l)a(l), etc. There are 2" such determinants, since 
each column may contain either a or 0. We will denote the above 
eigenfunction by 

la b c n 

v = 1 

a j3 • • • a 

The energy, to the first order, is given by the roots of the secular equation 

| Hij - SijE\ m 0 

232 
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where 



Si, = 



For the n-electron problem, the order of this equation is 2 n , so that the 
problem will be tractable only if we can break the secular determinant 
down into a product of determinants of lower order. In the atomic 
problem this was done by means of the operators M 2 , M*, S 2 , S*, all of 
which commuted with the Hamiltonian. Owing to the lack of spherical 
symmetry of most molecular systems, the operators M 2 and M z will no 
longer commute with the Hamiltonian, and so they lose their usefulness. 
To the approximation we are considering, in which spin interactions are 
neglected, the operators S 2 and S z commute with the Hamiltonian and 
may be used to reduce the order of the secular determinant. Each of 
the eigenfunctions <p is already an eigenfunction of S*, since each term 
in the expansion of the determinant of <p is an eigenfunction of S z with 
the same eigenvalue. The eigenvalue of any <p for S z is found from the 
relation 

S* = (n a - np) — (p 


where n a is the number of columns of ct s and is the number of columns 
jft’s. For n = 6, we have 2 6 = 64 eigenfunctions <p. Classifying these 
according to their eigenvalues of S z , we have: 

Eigenvalue of S, ^units of 32 1 0 —1—2—3 

Number of eigenfunctions 1 6 15 20 15 6 1 

Since Hi, = Si, = 0 if and <p, have different eigenvalues for S*, this 
classification results in a considerable simplification of the secular equa- 
tion. Further simplification is possible if the <p’ s are combined into 
linear combinations which are eigenfunctions of S 2 as well as of S*. 

Let us imagine the n atoms to be divided into pairs (a, b ) (c, d), etc., 
with each pair at a great distance from all the other pairs. The system 
will be most stable if there is a bond between a and 6, one between 
c and d, etc. According to our previous discussion, a and b will have a 
stable bond between them only if the spins of the corresponding electrons 
are paired; the same condition holds for each of the other pairs. Such a 
distribution of spins corresponds to the condition S*<p = 0. Even if the 
atoms are not separated, it would seem reasonable to assume that the 
most stable configuration would be that corresponding to the maxi- 
mum number of bonds. We will therefore focus our attention on the 
determinants containing only the (p’s which have the eigenvalue for the 
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operator S, equal to zero; that is, in the six-electron case for example, 
we assume that the ground state of the molecule is given by one of the 
roots of the twenty-row secular determinant. The eigenfunctions <p 
are not in general eigenfunctions of S 2 , but we can form linear combina- 
tions of the <p’ s which are. Rather than treat the general case, let us 
consider the system of four electrons. The eigenfunctions <p which have 
the eigenvalue zero for S* are 

abed 

<P 1 a a P p 

V2 a P a P 

<PZ p a a P 

<Pi a P p a 

<Ph P « P ol 

P P a a 

We will now form a linear combination of the <p’ s which corresponds to a 
bond between a and b and one between c and d. This requires a and b 
to have opposite spins, so that we are limited to the functions <p 2 , <*>3, 
<P4 f and <ps. The combination will therefore be of the form 

1 Pab , cd = (*2<P2 + ds<P3 + &4<P4 + <*5<P5 13*2 

If we interchange the spins on a and 6, the function yp ah , c a must change 
sign, since the spin function associated with a stable bond is antisym- 
metric in the electrons. We therefore obtain 

'I'ab, cd = “"02^3 ~ ®4^5 «5^4 13*3 

Performing the same operation with the spins on c and d, we have 

&ab, cd — —a 2 <P 4 — a S (p 5 — 0 4 ^ 2 — o 5 ^> 3 13*4 

The above equations are consistent only if o 5 = o 2 , 03 = 04= —02. 
We have, therefore, for the (unnormalized) function representing the 
two bonds 0-6, c-d, the bond eigenfunction 

tab, cd = <P2 — <P3 ~ ^4 + <P5 13*5 

In a similar maimer, we find the bond eigenfunctions 

Tpad, be = <P1 <P2 ~ <P5 + <PQ 13*6 

tac, bd = <Pl <P3 ~ <P4 + ^6 13*7 

We note that these bond eigenfunctions can be written as 

6 

fat, ki = Z hi(n) hi(n)<Pn 
n-l 


13*8 
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where 6^(71) is 1 if i has the spin a and j has the spin /3; 5#(n) is — 1 if 
i has the spin p and j the spin a ; 6,7 (n) = 0 if i and j have the same spin. 
This expression can readily be generalized. 

If we want the bond eigenfunction corresponding to a bond between 
a and b, but none between c and d, we proceed in an analogous manner. 
Only <p 2 y (pz y <P 4 , <pb can be used, as they are the only functions in which 
a and b have opposite spins. This function must be antisymmetric 
with respect to exchange of the spins on a and b but symmetric with 
respect to the exchange of the spins on c and d. We readily find the 
appropriate bond eigenfunction to be 

6 

tab = <p2 ““ <P3 + <P4 ~ <P5 = 2Z &abW<p n 13*9 

n= 1 

The other one-bond eigenfunctions are 

tbc = <Pl ~ <P2 + <£>5 — ^6 
ted = <P2 + <P3 “ ~ <P5 

\ Pad = <Pl + <P2 — <P5 13*10 

tac = <P1 ~ (PS + (P4 ~ <P* 
tbd = <Pl + ^3 "" ““ ^6 

The bond eigenfunction corresponding to no bonds is obviously 

^ + ^2 + ^3 + ^4 + ^5 + ^6 13*11 

This eigenfunction is symmetric with respect to exchange of the spins 
on any pair. 

Not all the above bond eigenfunctions are independent. As our 
independent set we take the bond eigenfunctions tab, cd> tad, bo tab, 
tbe, ted , t • The remaining bond eigenfunctions can be expressed in 
terms of these as 

tac, bd = tab, cd + tad, 6c 
tad = tab + tbc + ted 

10*12 

'Pa e = 'Pab + 'Pbe 
'Pbd = 'Pbc + 'Pcd 

We will now show that the bond eigenfunctions derived above are eigen- 
functions of S 2 , and further, that bond eigenfunctions corresponding to 
different numbers of bonds have different eigenvalues for S 2 . Accord- 
ing to equation 9-3, thespinoperators S*, S„ and the spin eigenfunctions. 
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a and p obey the relations 
(S*i + tSyi)a(l) = 0 

(S,i + iS y i)P(l) - ^;a(l) 

(S xl -iS„i)«(l) -^0(1) 

(Sai - iS v iMD = 0 

where S x i is the operator for the x component of the spin of electron (1), 
etc. Denoting the <p eigenfunctions by the abbreviated notation 
(apa j8), etc., we have 

h 

(S* + iSy) (aapp) = — { (aaaP) + (aapa ) } 

h 

(Sa + iS y ) (apaP) = ~ { (aaaP) + (apaa ) ) 

(Sa + iS y )(Paap) = — { (aaaP) + (pacta) J 
(Sa + iS y )(apPa) = — { ( aapa ) + (aPaa ) } 

(S* + iSy)(PaPa ) = — { ( aaPa ) + (Jpaaa) } 

(Sa + iS y )(Ppaa) = — { ( aPaa ) + (paaa ) } 

(Sa - iSy) (aapp) = ^ { (pap p) + ( appp )} 13-14 

(S x - iS v )(apap) - { (ppap) + (a»)] 

(Sa - iS v )(paap) = ~{(ppap) + (papp)} 

(Sa — iS y )(apPa) = ^ { (PPPa) + (appp)} 

(Sa - iSy)(papa) = £-{(pppa) + (papp)} 

(Sa - iSy)(ppaa) = { (PPPa) + (PP*P)\ 


(S x + is„) = 2KS xn + is yn ) 

n 

13-13 

(Sa - t'S„) = £(San - iS„„) 
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Operating on the bond eigenfunctions we therefore obtain 

(S* + iSytyab, cd = 0 
(S x + iSyWad, be " 0 

Since 

s 2 = (s. - ts„)(s, + is u ) + A s, + s 2 


and since the eigenvalues of the bond eigenfunctions for the operator S* 
are zero, we see that the bond eigenfunctions representing two bonds are 
eigenfunctions of S 2 with the eigenvalue zero. 

Operating on fob, we obtain 


(S* + lS v )fob = ~ 


(aaafi) + (a&aa) — (aaafi) — ( (Saaa ) 

.+ (aaj 3a) -f- ( afiaa ) — (aa/3a) — (fiaaa) 


1346 


= 2 — { (a$aa) - (jffaaa) } 


(S# — " (S, + y)^ab — 2 


4tt 2 
= 2 


(Pfiaa) + (aft fiat) + (afiafi) 

— (pfiaa) — (fiafia) — (paafi) 

h 2 , , ) o ^ 

^2 1^2 “ <P3 + <P4 “ <P5 f = 2 


is therefore an eigenfunction of S 2 with the eigenvalue 1 (1 + 1) — £• 

47r 

In the same way we find that fo c and are eigenfunctions of 
9 h 2 

S 2 with the eigenvalues 1(1 + 1) —z; and that \p is an eigenfunction 

4ir 

9 h 2 

of S z with the eigenvalue 2(2 + 1) 7-5- Since matrix elements be- 

47T 

tween functions with different values of their eigenvalue for S 2 vanish, 
the six-row secular determinant for the four-electron problem is thus 
reduced to one three-row, one two-row, and one one-row determinant. 
The energy of the ground state of the system should be given by one of 
the roots of the two-row determinant, since the bond eigenfunctions 
involved in this secular determinant correspond to the maximum number 
of bonds. This is further confirmed by the observation that the ground 
state of most molecules is non-paramagnetic, indicating zero spin. 

The above results are readily generalized for more than four electrons. 
To summarize the procedure when we are interested in the ground state 
of a complex molecule, we first form the zero-order eigenfunctions <p in 
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determinantal form. These functions are eigenfunctions of S,; we 
consider only those for which = 0. From these functions <p we 
form the bond eigenfunctions by the rule 

hi, kl.mn ... = hi{v) S mn (v) 13-17 

P 

Bond eigenfunctions corresponding to different numbers of bonds have 
different eigenvalues for S 2 ; we consider only those corresponding to the 
maximum number of bonds, which have the eigenvalue zero for S 2 . 
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a b 
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Fig. 13*1. Bond eigenfunctions for four electrons. 


The energy of the ground state is then given by the solution of the 
secular equation | Hy — SijE\ = 0. (In the above discussion, we 
implicitly assumed the number of electrons to be even. For an odd 
number of electrons, the problem is first solved for the next higher even 
number, and then all integrals in the secular equation which involve the 
extra electron are placed equal to zero.) In the general case, no further 
reduction of the secular determinant can be made. If the molecule 
possesses external symmetry, a further reduction is possible, as will be 
shown later. 

The possible bond eigenfunctions for the four-electron problem and the 
relations between them are illustrated graphically in Figure 13*1 (from 
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equations 13-12). We note that a kind of vector addition law holds for 
the bond eigenfunctions. Usually we are interested only in bond eigen- 
functions corresponding to the maximum number of bonds. Figure 13*1 
tells us how to determine the number of independent bond eigenfunctions 
for any number of electrons. We arrange the symbols for the orbitals 
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Bond eigenfunctions for six electrons. 

in a circle, which, of course, need have no correlation with the actual 
structure of the molecule. We then draw the bonds in all possible ways. 
Some of these will involve bonds which cross, such as fac, bd» Any two 
bonds which cross can be uncrossed by the application of equations of 
the type 


a b 

f 


e d 

XX 


Fig. 13-2. 


tac, bd = tab, ed + tad, bo 


1318 
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This procedure can be continued until the original bond eigenfunction 
has been expanded into a linear combination of bond eigenfunctions 
which do not involve crossed bonds. Conversely, any eigenfunction 
can be built up from eigenfunctions which do not involve crossed bonds. 
We thus have the theorem due to Rumer: 1 arrange the orbitals in a 
circle, draw all the structures which contain the maximum number of 
bonds, but, draw only those which contain no crossed bonds. The 
eigenfunctions corresponding to these structures are linearly inde- 
pendent, and, as all other bond eigenfunctions representing the maxi- 
mum number of bonds can be expressed in terms of them, they form a 
complete set. 

In order to form the bond eigenfunctions with the eigenvalue of S, 
equal to zero, but representing less than the maximum possible number 
of bonds, we proceed in an analogous manner, including only those bond 
structures which are not derivable from others by the rule of vector 
addition. For six electrons, for example, we have the bond eigenfunc- 
tions as given in Figure 13*2. There is a total of twenty, as this is the 
number of <p’s with the eigenvalue zero for the operator S z . The deter- 
mination of the energy of the ground state for a six-electron problem 
would thus involve the solution of a five-row determinant. 

13b. Evaluation of the Integrals. In order to complete the solution 
of the problem, we need the values of the integrals of the type 


Hab = f tints dr and Sab = J titB dr 


Since the t’ s are expressed as linear combinations of the <p s, we will first 


focus our attention on the integrals of the type 



ing to the four-electron case, we consider the eigenfunctions 


Retum- 


^ Z ( - im 1 { (o«) 1 (bah (c/3) 3 (d/3) 4 } 


= ~Z(-l) v 'K{(aah(bl3h(c«hm*} 


The integral J <p*H<p 2 dr is therefore 

H 12 = |j/| [E(-l)’'i > i{(aa)i(&«)2(c/S) 3 (d/3) 4 }]* 

niz(-iy'p*A(ao'hm 2 (c«hmi}]dT 1319 

r' 

1 G. Rumer, Gottingen Nachr., 1932, 377. 
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If the operator PI is multiplied by another permutation operator, the 
result will be unchanged, since the integral already contains all possible 
terms. As in dealing with the atom, we therefore multiply by the 
inverse operator ( — lJ'LPj] -1 , obtaining for the first summation 

ZPliPlr 1 1 (aahibah (c/3)s W)* } = 4! { (aa) 1 (6^) 2 (^3(^)4} 

9 

since there are 41 identical terms in this summation. The integral is 
thus reduced to 

#12 = f i(a<x)i(ba) 2 (cp) a (dp) 4 } 

ni'L(-ir"PA(aa) 1 m2(ca) 3 mi}} dr 13-20 

P H 

Because of the orthogonality of the spin functions, all the terms in 13-20 
will vanish except those for which the spins match identically. Inte- 
grating over the spins, we therefore have 

#12 = - (a&cd|H|ac&d) + (abcd\H.\acdb) 

+ (a&cd]H|cabd) - (abcd\K\cadb) ' 2 

where 

(abcd\H\acbd) = J* (afaczd^Hfa^bad^ dr, etc. 

If we now assume that the one-electron functions a, b , etc., are mutually 
orthogonal (or are willing to set all integrals involving multiple exchanges 
of electrons equal to zero), all the above integrals with the exception of 
the first are equal to zero; this integral will be further abbreviated to 
(6c). We thus have the result 

Hi 2 - - (6c) 13*22 

This result is perfectly general: The matrix element Hij between two 
different <p functions is zero unless the functions differ only in the spins of 
two orbitals; then it is the negative of the corresponding exchange integral . 
For the integral Hu we obtain the result 

#11 = / { (aa)i(ba) 2 (c/?) 3 (d(3)* ! #[ ( - 1 { (aa) ! (ba) 2 ( C 0) 3 (d/?) 4 } ] dr 

=* (a6cd|H|a6cd) — (a6cd|H|a6dc) — (a6cd|H|6acd) 

- Q - («*) - (a6) 13-23 

That is : The matrix element Hy between a <p function and itself is the cour 
lombic integral Q minus the sum of all exchange integrals between orbitals 
having the same spin. 
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Continuing with the four-electron problem, and denoting <p a b, cd by 
\f/ A and tad, be by fa, we require the matrix elements H A a, Hbb, Hab- 
In terms of the integrals # 12 , etc., we obtain from 13-5 and 13-6 the 
results 

Haa — H 22 + #33 + #44 + #55 

+ 2 (.ff 2 5 + #34 — #23 ~ #24 ~ #35 ~ #45) 

Hbb = Hu + #22 + #55 + #66 

+ 2( — #12 - #15 + #18 - #28 + #25 - #56) 

HaB = #12 — #13 ~ #14 + #15 ~ #22 + #23 + #24 — 2#26 

+ #35 + #45 — #55 + #26 ~ #36 — #46 + #56 

The necessary matrix elements #,-/ are readily found by application of 
the rules given above ; we have 

#n « Q - ( ab ) - (cd) Hu = - (be) # 25 = 0 

#22 = Q — (ac) — (bd) #13 = — (ac) # 26 = — (ad) 

#33 = Q - (ad) - (be) Hu = - (bd) #34 = o 

#44 — Q ~ (ad) — (be) H 15 = — (ad) #35 = — (cd) 

#55 - Q - (ac) - (6d) #16 = 0 #38 - - (bd) 

Hoe = Q ~ (ab) — (cd) #23 = — (ab) H 45 = — (ab) 

#24 = — (cd) #46 = — (ac) 

#56 = ~(bc) 

Therefore 

Haa — 4Q — 2{ (ac) + (ad) + (6c) + (6d) } + 4{ (a6) + (cd) } 

Hbb ~ 4Q — 2{ (aft) + (ac) + (cd) + (6d) } + 4{ (ad) + (6c) } 13-24 

Hab = — 2Q + 4{ (ac) -f- (6d) } — 2{ (a6) + (ad) + (be) + (cd) } 

Since J* <p*<pj dr = 5,/, we have the results Saa - Sbb = 4; Sab = — 2. 

In general, iSUb will be equal to the coefficient of Q in Hab- 
The above procedure, although straightforward, becomes rather 
tedious for more than four electrons. By an extension of arguments of 
the above type, the following general formulation can be made. 

To find the matrix element of H between two bond eigenfunctions, 
say (1) &»6, cd , «/. gh and (2) Vvc, bd, eg, /a, we note that the two electrons 
in any bond must have opposite spins. If we let the spin of a be a, that 
of ft must be /3 from (1); according to (2) the spin of c must be 0 and 
that of d must be a. The spin of e can now be chosen arbitrarily; for a 
given choice of the spin of e the remaining spins are fixed. We say that 
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a, b, c, d form a cycle and e, /, g , h another. Let us assign the spin a to e . 
This situation is denoted by 



where the orbitals associated with a’s are written above the line and 
those associated with p’s are written below the line. The matrix ele- 
ment of H for the bond eigenfunctions (1) and (2) is then, where x is 
the number of cycles, and v is the number of interchanges of orbitals 
which are assigned different spins in the diagram in \p 2 required to 
make it equal to 

#12 3=5 ( — 1)*'2 ®{Q + f[£ (single exchange integrals between 
orbitals in the same cycle with opposite spins) — (single 
exchange integrals between orbitals in the same cycle with 
the same spin)] — single exchange integrals)} 13-25 

Equation 13-25 is perfectly general; the proof goes as follows. Let us 
consider the matrix component of H between the two functions 

== tab, cd, ef , oh ^ t B = tac, be, dh. So 


for which the diagram is 



This integral will be expressed as a sum 


of integrals involving <p functions, of the type f nEvjdT. 


There are 


three possibilities: 

1. <piz=<pj; according to 13-17, the sign of the integral is (—1)*. 
This gives a contribution to Hab, according to equation 13-22, equal to 


(-1)’{2(Q - E[(ae)-type])} 

the factor 2 arising from the fact that either a or ft could be assigned to a . 

2. The two <p functions differ in the spins assigned to, say, a and 6. 
This integral is multiplied by — 1, relative to the coefficient of Q, as 
can be seen from 13-17, and therefore, according to 13-22, gives a con- 
tribution + (ab). Since there are two such integrals, and since the 
spins of any pair of this type could have been interchanged, the total 
contribution to Hab arising from integrals of this type is 

(-l)”{+2£[(a&)-type]} 

3. The two <f> functions differ in the spins assigned to, say, a and d. 
This integral has the coefficient +1, relative -to the coefficient of Q, so 
that the total contribution to Hab arising from integrals of this type is 

(-l)M-2£[(od)-type]} 
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and the integral II ab is 

Hab = (— l)” 2{Q + £[(a6)-type] - 2£[ (ad) -type]} 
in agreement with equation 13-25. 

For the functions = tab, a, ,/, 0 h and = ^ ac , w, <•„, /* we have 

. Again there are three cases. 

1. The two ip’s are identical. There are then four different ways of 
assigning spins to a and e : 


the diagram 


.(!)© 


a ~ a, e ~ a, gives (-l/fQ — £[(ad)-type] - £[(ae)-type]} 
a~a, e~/3, gives (-1) , [Q - £[(ad)-type] - £[(a/)-type]} 
a ~ /3, e ~ a, gives (-l) r {Q - £[(ad)-type] - £ [(a/) -type]} 
a ~ 0, e ~ /3, gives (-1) , '[0 - £[(ad)-type] - £[(ae)-type]} 

Cases 2 and 3 are identical with the similar cases above, except that 
the numerical factor is 4. Since it is impossible to change the spin on 
only one orbital in a given cycle, there are no integrals of this type in 
which the two functions differ in the spins of a and e, etc. We therefore 
have the result 

Ecu = (— 1)'4{Q - 2£[(ad)-type] + £[(a6)-type] 

— i£[(ae)- and (a/)-types] 


which is equivalent to the general expression 13-25. The proof can 
readily be extended to more general cases. 

Returning to the four-electron case, for II aa we have the diagram 

The matrix component of H is therefore 



b \d) 


Haa = 2 2 {Q + f [(ab) + ( cd )] 

- ?[(ab) + (ac) + (ad) + (be) + ( bd ) + (cd)] 

= 4 Q- 2 [(ac) + (ad) + (be) + (bd)] + 4 [(ab) + (cd)] 
in agreement with the previous calculation. For Hab we have the 
di^am(g) , so that 

Hab = (-1)2 [Q + f [(ab) + (ad) + (be) + (cd)] - £[(ac) + (bd)] 

— 2 [(a&) + (ac) + (ad) + (be) + (bd) ■+■ (cd)]} 

«= -2Q + 4[(ac) + (6d)] - 2[(a6) + (ad) + (6c) + (cd)] 

13c. The Two-Electron Problem. Although the two-electron prob- 
lem has previously been treated in detail, it is of interest to consider at 
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this point the results obtained by the above method. Denoting the 
orbitals by a and b, there are four possible <p functions: 

a b 

<fii a 0 

<P 2 P at 

<P 3 a a 

<P4 0 0 

Of these four functions, <p\ and <p-z have the eigenvalue zero for S*. The 
linear combination having the eigenvalue zero for S 2 is the bond eigen- 
function 

'pA = Pab = <P1 — <P2 

The energy of the ground state is therefore given by the solution of the 
one-row secular determinant 

Haa — SaaE = 0 

According to the rules developed above, 

Haa = 2 [Q + (ab)), Saa = 2 
so that the energy is 

E = Q + (ab) 13-26 

Comparing this with the result given by equation 12-7, we see that they 

become identical if the overlap integral <S — J PaPb dr in 12-7 is placed 

equal to zero, as it has been assumed to be in the present method. 
Although this discrepancy may appear to be quite serious, in actual 
practice a compensation is made, as will become evident later. 

13d. The Four-Electron Problem. The matrix elements have 
already been determined. Inserting these in the secular determinant 

Haa - SaaE Hab - S AB E 

= 0 

Hab ~ SabE Hbb ~ S BB E 

we obtain 

t(«i+«2) _ W — f (ai+a 2 +0i+02 — Ti - 72)+%W 13 26 

— ■f-(«l+«2+/3l+^2 — Y2)+§J^ f-(Pi+02) — W 

where the common factor 4 has been removed from each term, {ab), 
(cd), (ad), (be), (ac), ( bd ) have been replaced by aj, a 2 , 0i, 02, Yi, 72, 
respectively, and 

W = E — Q + £(«i + at + 0i + 02 + 7i + Y 2 ) 
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Writing « = <*i + a 2 , /? = ft + A 2 , 7 = Yi + 72 , 13 26 reduces to 
TP 2 — (a + d + y)W — -J(a + /S — y ) 2 -f- 3a|8 = 0 

so that 

w = <i±]L±l ± IV (a + 0 + y) 2 + 3(a + 0 - 7 ) 2 - 12a/3 
or 

E = Q =fc Va 2 + 0 2 + y 2 — a& — ay — 13-27 

Of these two solutions the one with the negative sign represents the more 
stable state; this may be written as 


E = Q - V*{ '(«'- W 2 + 09 - Y) 2 + (Y - «) 2 } 13-28 


a result first obtained by London. 2 

If we now remove the electrons c and d to infinity, equation 13-28 
reduces to 

Eab = Qab + <*1 13-29 


(the sign of <*i is taken to be + since a x is a negative quantity; equa- 
tion 13-29 therefore represents the lowest state of the system), which is 
identical with 13-26, as of course it should be. This result forms the 
basis for the “ semi-empirical ” method of treating complex molecules. 
We assume that the energy of any electron pair bond a-b can be repre- 
sented by an equation of the form E a b - Qab + ocab. The energy 
of this bond is obtained from the experimental data on some simple 
system, usually the data on the corresponding diatomic molecule. 


Some assumed ratio for 


Qab 


<*ab 


being taken, each of these quantities can be 


determined independently. It is then further assumed that the quanti- 
ties cti, etc., which appear in the energy equation for a many-electron 
system, are identical with the corresponding quantities for the two- 
electron system, and that Q for the many-electron system is equal to 

YfQa, where Qy is the coulombic energy for the two-electron system 
v 

i — j. For example, if we had the problem of four atoms A, B, C, D 
with the corresponding orbitals a, b, c, d, the various terms in 13-28 are 
obtained as follows. We draw the diagram illustrating the various 
energy terms (Figure 13-3). From experimental data on the diatomic 
molecules AB, CD, AC, BD, CB, AD we determine the energies E ab , 
Ed, E ac , Ebd, E eb , Ead . These energies may be expressed as functions 


1 F. London, Z. Eleletrochem., 86, 662 (1929). 
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of the corresponding interatomic distances by means of an empirical 
equation such as the Morse function (equation 14-31). We now assume 

Qab 

some value for the ratios — , etc., and thus calculate the individual 

quantities. These values of «i, etc., as obtained from the diatomic 
molecules, are now used directly in 13*28. Q in this equation is assumed 
to be given by 

Q = Qab + Qed + Qao + Qbd + Qcb + Qad 

It will be observed that this procedure is actually an extrapolation of 
the binding energy between atoms from the diatomic to the polyatomic 
problem. Some such extrapolation is necessary if any treatment of poly- 
atomic molecules is to be made, since actual evaluation of the integrals, 
even if a satisfactory set of zero- 
order functions were known, is 
completely out of the question. 

Since the above method of treat- 
ment involves the same type of 
approximation for both the dia- 
tomic and the polyatomic prob- 
lem, we would expect the defects 
arising from these approxima- 
tions to be at least partially 
compensated for by the method 
of determining the integrals from Fig * 13 ' 3 - Coulombic and exchange en- 
, i j , , , ,, . , orgies for a four-atom system, 

experimental data. Actually it 

must be recognized that this is an empirical method, and its validity 
must be determined largely by the results obtained. 

In determining Q a b and a x from E a &, it is necessary to assume some 
definite ratio for the two integrals. In the Heitler-London treatment 
of the hydrogen molecule, where the corresponding integrals can be 
evaluated, the total binding energy is about 10-15 per cent coulombic. 
In actual practice satisfactory results are usually obtained if the coulom- 
bic energy is assumed to be about 14 per cent of the total binding 
energy for a given electron pair. 

We may summarize the situation with regard to quantum-mechanical 
calculations involving polyatomic molecules as follows. The correct 
wave function for a system of n electrons would be a complicated func- 
tion of the 3n coordinates. Judging from the results on H. 2 > a satis- 
factory wave function should contain the interelectron distances r<, 
explicitly. However, with such a wave function, the n-electron prob- 
lem becomes completely intractable; we are forced to make drastic 
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simplifications. The first approximation that is made is to assume that 
the eigenfunction for the system can be written as a product of one- 
electron functions. Even in the simple case of H 2 , these wave functions 
will not give very accurate results. Using these functions as a basis, 
we carry out a first-order perturbation calculation in which the further 
approximation that the one-electron functions are orthogonal is made. 
The energy of the system is then obtained as a function of certain cou- 
lombic and exchange integrals. These particular integrals arise from 
the method of writing the zero-order wave function and might not appear 
in this particular form in a more accurate formulation of the problem. 
These integrals are in general evaluated, not by direct calculation, but 
from experimental data on simple systems. In spite of the approxima- 
tions, these calculations usually give results of considerable value in the 
interpretation of chemical phenomena. 

13e. The Concept of Resonance. We have seen that, for a system 
involving many electrons, it is possible to write a number of bond eigen- 
functions representing various ways of pairing the electrons to form 
two-electron bonds; the energy of the system is then given by the 
solution of a secular determinant of high degree. In many cases only 
one of these bond eigenfunctions will be of importance in determining 
the energy of the ground state of the molecule; we say then that the 
electrons are localized in particular bonds. If we assume that the ground 
state of a given molecule can be represented with sufficient accuracy 
by one bond eigenfunction £ 4 , the energy of the ground state is given by 
Haa 

E — -r — . From equation 13*25, we have 
Saa 

E = Q + H (exchange integrals between bonded orbitals) 

— (exchange integrals between non-bonded orbitals) 13*30 

This equation has been used in section 12c in the discussion of the water 
molecule, where it was assumed that the bonds could be drawn in a 
unique way. In many molecules there does not appear to be any single 
way of drawing the bonds which is a better representation of the actual 
state of the molecule than other possible ways of drawing the bonds; in 
this event the complete problem must be solved by the above methods. 
We then say that the molecule “ resonates ” among the various states 
represented by the individual bond eigenfunctions; the difference in 
energy between the actual molecule and the energy of the most stable 
bond eigenfunction is called the “ resonance energy.” This concept of 
resonance has been applied, especially by Pauling , 8 to the elucidation of 

1 See L. Pauling, The Nature of the Chemical Bond, Cornell University Press, 1940. 
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many problems in molecular structure. As an example of this concept 
and of the methods derived in this chapter, we calculate the resonance 
energy of benzene. 

13f. The Resonance Energy of Benzene . 4 Experimental evidence 
shows that benzene is a planar molecule, with the carbon atoms at the 
comers of a regular hexagon, and with all C — C — C and C — C — H 
angles equal to 120°. Taking the plane of the molecule to be the xy 
plane, we can form three carbon valence bonds with bond angles equal 
to 120° from the proper linear combination of the s, p x , and p v carbon 
orbitals. This leaves the six p z orbitals, one on each carbon atom. The 
valence structure is thus similar to that of ethylene. In benzene, how- 
ever, there is no unique way of pairing the spins of the p z electrons to 
form 7T bonds. We shall not attempt to solve the benzene molecule 
completely, but shall assume the s, p x , and p y carbon electrons and the 
hydrogen electrons to be localized in a bonds, and shall limit ourselves 
to the calculation of the binding energy arising from the interaction of 
the p z electrons. This is a six-electron problem; if we denote the p z 
orbitals on the various carbon atoms by a, 6, c, d, e,f, the five independent 
bond eigenfunctions corresponding to the maximum number of bonds are 



h \ % h ^ 


We note that, owing to the symmetry of benzene, the following rela- 
tions hold between the matrix elements : 


Haa = HsBy Hcc — Hdd = Hee 
Hac = Had = Hae = Hbc — Hbd = Hbe 
Hcd — H be = Hce 


We need calculate only the matrix elements Haa, 
which is readily done by use of equation 13-25. 

diagram 



so that 


Hab, Hac , Hcd , Hcc, 
For Haa we have the 


Haa = 2 3 {Q + f[(oi>) + (cd) + (ef)) — f £ (all exchange integrals) } 

= 2 3 {Q + |[(o6) + (cd) + (ef)] 

- UW + (be) + (cd) + (de) + (ef) + (of)) 

- iKac) + (bd) + (ce) + (df) + (ae) + (bf)] 

- %((ad) + (be) + (cf)]} 

Again, owing to the symmetry of benzene, there are only three different 


4 L. Pauling and G. W. Wheland, J . Ch#m. Phys 1, 362 (1933). 
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exchange integrals: the type (ab), the type (oc), and the type (ad). 
We denote these types by a, f), y, respectively, so that 

Haa - 2 8 {Q + fa - 30 - fy} 

Since exchange integrals will decrease rather rapidly as the distance 
between the atoms increases, we now make the approximation that 0 
and y can be neglected in comparison to a } and write 

H AA =2 3 {Q + %a} 


In the same way we find for the other necessary matrix elements the 
values 

Hoc = 2 3 {Q} 

H A b = 2 [Q + 6a} 

H AC - -2 2 {Q + 3a} 

Hcd = 2 {Q + 6a} 

The secular determinant, after the common factor 2® is removed, 
is therefore 

(* + v) (| + y) “(§ + *) 

(f + ») (* + ») -(f + ») 

~(l +y ) ~(l +y ) 

"d +! ') ~(l +y ) (!+*) 

where x — Q — E and y = |-a. After a slight manipulation, this deter- 
minant reduces to 



0 

0 

0 

0 


0 

(f« + y) 

1 

'wTh' 

+ 

0 

0 


0 

1 

+ 

G i+ *) 

0 

0 

= 0 

0 

0 

0 

(1® - %y) 

0 


0 

0 

0 

0 

(f* -2 y) 



(!+») 

-<N 

(M 

+ 

^ 1 Cj, 

I 

d +! ') 

(H 

X 

(H 


X 
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The roots of the secular equation are therefore 

x ** 0, %y f %y, -f (1 + V^13 )y, f (1 — VH )y 
so that the energy levels are 
E = Q 
E = Q — 2a 
E — Q — 2a 

a is a negative quantity, so that the energy of the ground state is 
E = Q - (1 - VlS)a = Q + 2.61a 
In determining the coefficients in the expression 


E = Q - (1 + Vl3)« 

E = Q - (1 - V / 13) a 


yj/ = Ca^A + Cb^B + C(*pc + Cd^D + Ce^E 


for the wave function of the ground state of benzene, we note that, 
owing to the symmetry of the problem, Ca « cb, cq — c& — (It is 
not necessary to make this assumption; the solution of the five simul- 
taneous equations will give the same result.) From the first equation 
in the set which determines the coefficients, we thus obtain the ratio 


Cc 5 V 13 - 17 
CA 24-6 Vl3 


» 0.434 


so that the unnormalized wave function for the ground state of benzene is 
'P = 'Pa + Pb + 0.434(^c + Pd + Pe) 


Had we assumed that the ground state of the benzene molecule was 
represented by one of the KekulS structures, say Pa, the calculated 
energy would have been E '= Q + 1.50a. According to the above 
calculation, the actual benzene molecule is more stable than this hypo- 
thetical molecule by the amount 1.11a. The resonance energy of ben- 
zene is therefore 1.11a. Had we based our calculation on the two 
Kekul6 structures alone, we would have obtained the secular determinant 



with the roots x = 0, x = —■§?/, corresponding to the energy values 
E = Q, E = Q + 2.40 a. The resonance energy here is 2.40a — 
1.50a = 0.90a. We therefore see that the greater part of the resonance 
energy arises from resonance between the two Kekul6 structures. 
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The factorization of the secular determinant is not accidental. In 
the general six-electron problem the determinant would not be further 
reducible; however, on account of the presence of symmetry in the 
benzene molecule we may reduce the determinant by the aid of group 
theory. Benzene belongs to the symmetry group Z> 6 *; in this particu- 
lar problem the operation i gives no new results, so that we may use the 
group D 6 .' In Table 13T we present the results of the application of the 


Or—b 

\ / 


/ 


a b 


\ 


B 


TABLE 13-1 
a b a 

\ 

f C / 


\ 

c 

\\ 

D 



A 

B 

C 

D 

E 




E C 2 Cf C3 Ct CeCiia) C 2 (b) C 2 (c) C 2 (ab) 


A B A A B B 
B A B B A A 
C C E D D E 
D D C E E C 
E E D C C D 
5 3 2 2 0 0 


B B B A 

A A A B 

E D C C 

D C E E 

C E D D 

1113 


C 2 (be) C 2 (cd) 

A A 

B B 

E D 

D C 

C E 

3 3 


TABLE 13*2 


Character Table for 




E 

C 2 

2 C 3 

2 C« 

3 C 2 

3 C 2 

IT 

At 

1 

1 

1 

1 

1 

1 

IT 

At 

1 

1 

1 

1 

-1 

~1 

IT 

B 1 

1 

-1 

1 

-1 

1 

-1 

r 4 

B 2 

1 

~1 

1 

~1 

-1 

1 

IT 

Ei 

2 

2 

-1 

-1 

0 

0 

IT 

Ei 

2 

—2 

-1 

1 

0 

0 


+ 

5 

3 

2 

0 

1 

3 


operations of this group to the five bond eigenfunctions, which have been 
denoted by the letters A, B,C, D, and F rather than by fa, etc. In this 
table, Ct means a rotation by +120°, etc.; c' 2 (a) is a rotation about the 
symmetry axis through atom a; <&' (ab) is a rotation about the symmetry 
axis perpendicular to the bond a — b, etc. The character of the repre- 
sentation which has this set of bond eigenfunctions as its basis is given in 
Table 13-2, which contains the characters of the irreducible representa- 
tions of the group. x(fa was determined directly from the results given 
in Table 13- 1. r(%f-) can now be expressed in terms of the irreducible 
representations of the group as 

r«0 - 2 r x + r 4 + r 6 
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By means of equation 10*45 we now find the proper linear combinations 
of the original set which are the bases for these irreducible representa- 
tions. The various sums Ext6R)&A> etc., of interest are given in 

R 

Table 13*3. From the three combinations denoted by (1), (2), (3), we 


TABLE 13*3 


Exi (R)RA = 6 (A + B) £*4 (R)RA = 6 (A - B) 

R R 

Exi (B)BB = 6 (A + B) Ex4 (B)BB = -6(A - B) 

R R 

Exi (R)RC =4 (C + D + E) Lx4 («)«? = 0 

R R 

Zxi (R)RD = 4(C + D + E) Zxi (R)BD = 0 

R R 

Zxi(R)RE =4 (C + D + E) Ex 4 (fi)JW = 0 

R R 

Zxi(R)«a = 0 
R 

Zx6 (B)fiB - 0 

R 

I]x6(ft)fiC ^ 4C -2E -2D (1) 

R 

T,xs(R)RD = 4D -2C -2E (2) 

R 

'ZxsWRE = 4E - 2D - 2C (3) 

R 

can form the independent linear combinations (2) — (3) = 6(D — 1?) 
and (1) — (2) = 6(C — Z>), so that the linear combinations which are 
bases for irreducible representations of the group are 


'Pi = &A + i'B I 

r i 

ifo = 4*C + 'I'D + 'I'El 

'f'Z = *pA 'pB ) r 4 


\p 4 - IpC ~ 'I'D 
ts = I'D ~ 'pE 


Since matrix elements between eigenfunctions belonging to different 
irreducible representations of the group vanish identically, our problem 
is reduced by this method to the solution of two two-row and one one-row 


determinants. 

These 

are: 



#11 - 

S n E 

Hu ~ 

S n E 

(f* + 4 y) - 

(3x -f 6 y) 

#12- 

S \zE 

# 22 - 

S 22 E 

~(3x + 6 y) 

(fx + 6 y) 




(#83 ~ 

S 3 zE) = fyx = 0 


# 44 - 

BuB 

#45 ' 

- S iS E 

& 

1 

& 

1 

(-• | « + jr) 

# 46- 

S 45 E 

#55 ' 

- S 55 E 

l(-f* + y) 

(fx - 2y) 


( 1 ) 

( 2 ) 

( 3 ) 
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where the notation is the same as above. The matrix elements H 12 , 
etc., are easily found from the previously derived matrix elements 
Hab, etc. The roots of (1) are 

x = f(l + V / i 3)y, as-f(l - VlZ)y 

For the coefficients in the expression ^ = afa + 02^2 we obtain, for the 

root x = §(1 — Vl3 )y, the ratio — = 0.434, which is identical with the 

^1 

previous result. From (2) we have x = 0; from (3) we have x = 4$, 
x = 4$, so that the results are in accord with our previous findings, as of 
course they must be. Determinant (3) can be reduced further, since the 
two roots are known to be equal from the fact that the functions and 
^5 belong to a doubly degenerate representation. 

The calculated resonance energy of benzene has been found to be 1.11a, 
where a is the exchange integral between the orbitals on adjacent carbon 
atoms. It would be of little value to attempt to calculate this integral 
directly; we may estimate its value in the following manner. From 
thermochemical data, Pauling finds that the strength of the C — C bond 
is 59 kcal., that of the C=C bond is 100 kcal. From our previous dis- 
cussion of ethylene, we see that the difference between these values, 
41 kcal., should be approximately equal to the strength of the it bond 
between the p z orbitals. Since the energy of a bond is largely exchange 
energy, we would therefore estimate the value of the integral a to be 
about 40 kcal. (Since the carbon-carbon distance in benzene is greater 
than that in ethylene, this estimate should be a maximum.) We would 
therefore expect the resonance energy in benzene to be of the order of 
40 kcal. From thermochemical data, it is found that 1039 kcal. is 
required to break all the bonds in benzene, while only 1000 kcal. would 
be required to break three C — C bonds, three C=C bonds, and six C — H 
bonds; that is, the experimental resonance energy is 39 kcal. This 
agreement is perhaps better than should reasonably be expected, 
although we shall see later that this method gives internally consistent 
results for benzene and the other condensed ring systems. First, we 
wish to treat the resonance energy in benzene by the molecular orbitals 
method. i' 

13g. The Resonance Energy of Benzene — Molecular Orbitals 
Method. 5 In the molecular orbitals method, we do not attempt to 
find a set of functions with the significance of the bond eigenfunctions 
used above. Rather, we form molecular orbitals which are linear combi- 
nations of the atomic orbitals on the six carbon atoms. From the six 

6 E. Hdckel. Z. Physik, 70, 204 (1931); 72, 310 (1931); 76, 628 (1932).- 
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atomic orbitals, we can form six independent molecular orbitals. Two 
electrons can then be placed in each of the three most stable molecular 
orbitals, giving the state of lowest energy. We denote the six orbitals 
on the six carbon atoms by p a , Pb> Pc, Pd, Pc, and p/. These are con- 
sidered to be the solutions of a Hartree-type calculation, although we 
shall not specify them further. The molecular orbitals are then of the 
form 

P = C a p a + C b pb + Cepe + Cjp d + C e p e + Cfpf 


Carrying out the usual first-order perturbation calculation, we are led 
to the secular equation 


W p 0 0 0 0 
p W p 0 0 0 
o p W p 0 0 
0 o p w p 0 
0 0 0 p w p 
p 0 0 0 p w 


13-31 


where W = Q — E, Ha = Q, ff.y = p if i = j ± 1, Uij = 0 otherwise; 
that is, the pd s are assumed to be normalized and mutually orthogonal, 
and the integrals of H between non-adjacent carbon atoms are placed 
equal to zero. The roots of this equation can be shown by an algebraic 
method to be 

W = 2p cos k = 0, 1, 2, 3, 4, 5 13-32 

o 

or 

W - 2 p, p, -p, -2 p, -P, p 13-33 


We shall later obtain these roots by a different method. The integral P 
is presumably negative, representing attraction, so that the three 
lowest levels are those for which W = —2 p, —8, —p, or 


Ei = Q + 2p, E 2 = Q + P, Ez = Q + p 

If we place two electrons in each of these levels, we find the energy of the 
six p x electrons in benzene to be E = 6Q + 8 p. For one of the Kekul6 
structures, we have the corresponding secular equation 

W p 0 0 0 0 

p w 0 0 0 0 

0 0 w p 0 0 

0 0 p WO 0" 

0 0 0 0 w p 
0 0 0 0 p w 
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with the triply degenerate lowest root W = — /3. For this structure the 
energy is therefore E = 6Q + 6/3, so that the resonance energy of 
benzene, on the basis of this treatment, is 2.00/3 . It does not seem 
possible to estimate the value of /3 directly; we may test the theory as 
follows. The resonance energy of the series of compounds benzene, 
naphthalene, anthracene, and phenanthrene is calculated by this 
method. The resonance energies will be functions of the parameter /3. 
If the values obtained by comparison with experimental data agree, we 
may assume that our description of the structure of benzene is sub- 
stantially correct. The same test can be applied to the method of bond 
eigenfunctions. The results obtained are tabulated in Table 13-4. 6 



A 

TABLE 13-4 

B 

C 

D 

E 

Benzene 

39 

1.11a 

35 

2.00/3 

20 

Naphthalene 

75 

2.04a 

37 

3.68/3 

20 

Anthracene 

105 

3.09a 

34 

5.32/3 

20 

Phenanthrene 

110 

3.15a 

35 

5.45/8 

20 


A =» experimental resonance energy. 
B, D =■ calculated resonance energies. 
C y E = calculated values of a and 0. 


It is seen that both theories give consistent values for the parameters; 
the molecular orbital method seems slightly better here. In general, it 
appears that both methods give about the same results; 7 sometimes the 
method of bond eigenfunctions is slightly superior to the molecular orbi- 
tals method, as it was for H 2 . 

By means of group theory, we can find directly the molecular orbitals 
which form bases for irreducible representations of the symmetry group 
of the benzene molecule. The procedure is identical with that followed 
in finding the proper linear combinations of bond eigenfunctions. In 
place of the set of five bond eigenfunctions we use the set of six atomic 
orbitals as the basis for the reducible representation. The character of 
this representation is 

E C 2 2 C z 2 Co SC 2 3C' 2 ' 

xtt) = 6 0 0 0,2 0 

which, in terms of the irreducible representations gives 

rW - r x + r 3 + r 5 + r 6 

Proceeding as above, we find the linear combinations which are the bases 

e E. Huckel, Reference 5. L. Pauling and G. Wheland, J. Chem. Phys., 1, 362 
(1933); G. Wheland, ibid., 3, 356 (1935). 

7 G. W. Wheland, J. Chem. Phys ., 2, 474 (1934). 
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of these irreducible representations to be 

^ [+ a + * b + ic + + */} (ro 

fa = ^ {fo ~ 'Pb + — 'I'd + ~ ^/} (r 8 ) 

lfo = {2^o ~ lf-6 - 'Pc + 2^ 

1 


{2ft, — ft, - ft + 2ft 


*‘-VT 2 l *‘ 


2fa + 4*c + td ~ 2^e + ^/1 


{ 2^0 + ^6 — — 2^ d - \pe + fa) 


fa = {&* + 2^6 + fa ~ ~ 2^ e - fa] 


From these linear combinations, we can immediately write down the 
pertinent matrix elements of H. These are 

H n =Q + 2/3 i? 34 - i(Q -%) 

H 22 = Q — 2p H 55 — Q + P 1 q opr 

H 33 = Q — P H 66 = Q + (3 l66t> 

Ha = Q — p H 5 q = %(Q + p) 

The solution of the one- and two-row determinants now gives us the 
energy levels : 

E~Q + 2P (ro E — Q — p (r 5 ) 1 o oa 

E-Q-2 P (r 3 ) E~Q + p (r 6 ) l66 * 

The six electrons go into the lowest three molecular orbitals fa, $ 5 , fa. 
The molecular orbital with the lowest energy, fa, represents a charge 
distribution in which the electrons are distributed symmetrically about 
the ring. The second level, which is degenerate, has associated with it 
an unsymmetrical charge distribution. \p 5 , for example, represents a 
charge distribution in which the electron density on atoms a and d is 
four times greater than the density on any of the other atoms. This 
does not mean that the resultant electron distribution is unsymmetrical. 
If we take the normalized linear combinations 


(fa + fa) 


(fa + fa — fa — fa) 


tf'e = (fa - fa) = (fa - fa + 2^ c - fa + fa - 2 fa) 

we see that we have a symmetrical distribution of charge if both fa 
and fa are occupied by electrons. 



CHAPTER XIY 


THE PRINCIPLES OF MOLECULAR SPECTROSCOPY 

In. the preceding chapters we have discussed the energy levels and 
electronic wave functions for molecules in which the nuclei were assumed 
to be at rest. For a diatomic molecule the electronic energy levels were 
given by the solution of the equation 

{H 0 — E 0 (r)}\p — 0 14-1 

where the electronic energy E 0 (r) was a function of the internuclear 
distance r, the stable position for the nuclei being that which made 
Z?o(r) a minimum. We wish in this chapter to drop the restriction that 
the nuclei remain in fixed positions, and to consider the possible wave 
functions and energy levels for actual molecules in which the nuclei 
may move relative to one another or relative to a set of axes fixed in 
space. 

14a. Diatomic Molecules (Spin Neglected). For the discussion 
of diatomic molecules we will use the following coordinate system. 
Let x', y', and z be the axes of a rectangular coordinate system fixed in 
space, and let x, y, and z be the axes of a similar movable system with 
the same origin. The orientation of the movable system relative to the 
stationary system will be defined by the Eulerian angles 0, x, <p, as illus- 
trated in Figure 14-1. The equations giving the relation between 
x, y, z and x', y , z' are: 

x' = x(cos ip cos x — cos 6 sin <p sin x) 

— y(sin <p cos x + cos 6 cos sin x) + 2 sin 0 sin x 

y' = x(cos <p sin x + cos 6 sin ^ cos x) 14-2 

— y(sin <p sin x ~ cos 0 cos <p cos x) ~ 2 sin 6 cos x 

z' = x sin 0 sin <p -f- y sin 0 cos <p + z cos 0 

The concept of a molecule possessing electronic, vibrational, and rota- 
tional energy levels is quite familiar. Similarly, several investigations 
along the lines indicated in section 11a have shown that the complete 
wave equation for a diatomic molecule can be separated into parts corre- 
sponding to electronic, vibrational, and rotational motion, the terms 
which must be neglected in order to make this separation being in general 
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X 


very small. 1 The results of these investigations show that if the com- 
plete wave function is written as 

4> = F(x, y, z, r)R(r)U(6, x) 14-3 

then the component wave functions satisfy the equations 

(Hofo y, z, r) - E 0 (r)}F(x, y, z, r) = 0 14-4 

{dt rX r ’£) +E - BW - 0 146 

{ -L >1) + 4 - a - ft coa ,Y + 

Ism e dd \ ddj sin 2 e\dx J h 2 

U ( 0, x) = 0 14-6 

In these equations E is the total energy of the system, r is the intemu- 

clear distance, n is the reduced mass, and X is the angular momentum 

about the z axis associated with the electronic wave function F(x } y } z } r). 
R(r) and U(6, x) are the vibrational and rotational wave functions, 
respectively; we see that the rotational and electronic energies enter the 
vibrational wave equation as effective potential energies. We shall first 
investigate the possible states of the system and the symmetry proper- 
ties of these states before we consider the energy levels in detail. 

Equation 14*4 is just the equation for the electronic energy which was 
discussed in Chapter XI. The electronic wave functions F(x, y } z ) r) 
therefore have the symmetry properties of the various irreducible repre- 
sentations of the groups D^ h or 
Coov according as the nuclei are 
identical or different. The vibra- 
tional wave function R(r ) de- 
pends only on the distance 
between the two nuclei and 
therefore belongs to the totally 
symmetrical representation. The 
complete wave function will thus 
have the symmetry properties of 
the product F(x } y , z, r)U(d , x). 

In order to discuss the nature of 
the solutions of 14*6 it will be con- 
venient to consider first the wave 
equation for a symmetrical top, 



Rotating coordinate system. 


that is, a rigid body with rotational symmetry about one axis. Using 
the coordinate system of Figure 14 - 1, with the -symmetry axis of the rigid 

1 M. Bom and E. Oppenheimer, Ann. Physik , 84, 457 (1927). R. Kronig, Z . 
Physik , 46, 814; 50, 347 (1928). J. Van Vleck, Phys. Rev. t 33, 467 (1929). 
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body along the 2 axis, the wave equation 2 for the system is 


1 d_ 

sin 0 dd 


( 8il9 ») 


+ 


1 aV 


sin 2 6 die 


+ 


( cc 
si 


cos 2 6 .AN d 2 ij/ 
sin 2 6 C/ dip 2 


2 cos 6 aV , 8x 2 4 
sin 2 6 d x dip + h 2 * 


0 14-7 


where C is the moment of inertia about the symmetry axis and A is the 
moment of inertia about an axis perpendicular to the symmetry axis. 
If we now set 


* = e(8)e iKv e iMx 


we obtain the equation 
1 d_ 
sin 6 dd 


( . „ dQ\ \ M 2 , ( cos 2 6 , A\ rr2 

\ am 9 WJ “ 15 ?* + + c) K 


sin 2 6 
2 cos 6 


8t 2 A 

KM -tt-E 


. 9 « ■**«*«■ o 

sin 2 0 A 2 

For the special case in which K = 0, this equation reduces to 

M 2 8t r 2 A 


0 = 0 14-8 


1 d / . 
sin 6 dd \ S1 


• 0 dQ \ 


sin 2 d 


Me- 0 


14-9 


which is identical with the equation discussed in section 5c. The 
energy levels are therefore 

Ej, o.u = -^J(J + 1) 14-10 

and the wave functions are 

e,. o. M = pT (cos d) i4-ii 

For K 0, the solutions of 14-8 are quite complex, and we shall give 
here only those results which are significant for our discussion. The 
energy levels in this case are 


Ej, K, m — 


_ ft 2 f J(J + 1) 


8 A 


+ K‘ 


e-ai 


14-12 


where J > |x|, and the solutions 0^, Ki m have the property that 

Qj, k. r -«)-(- l) W_M e;, —k, m (0) 14-13 

Returning now to equation 14-6, we see that if we write 

U(d, x ) = e'(d)e iMx 

* F. Reiche and H. Rademacher, Z. Physik , 39, 444; 41, 463 (1927). R. Kronig and 
1. Rabi, Phya. Rev., 29, 262 (1927). D. Dennison, Rev. Modem Phya., 3 f 280 (1931). 
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we obtain the equation 

_1_ d/ . JQ'\ [ M 2 cos 2 e 2 

sin 0 dd \ n dd ) Isin 2 6 sin 2 0 


}e' 

sm 2 e h 2 1 

= 0 14-14 


This is identical with equation 14-8, provided that we replace K by X, 

A by jLtr 2 , and put -* equal to zero in equation 14-8, so that 
C 

Q r (8) — Qj , K , a i(d) 


14b. Symmetry Properties of the Wave Functions. 3 If ail the 
particles of the system are subjected to an inversion through the origin 
of coordinates, the transformation is equivalent to changing from a right- 
handed to a left-handed coordinate system. The Hamiltonian operator 
is invariant under such a transformation, so that the complete wave 
function must either remain unchanged or must change only in sign. 
This inversion at the origin is equivalent to a rotation of 180° about an 
axis perpendicular to the z axis, which we may take to be the y axis, 
followed by a reflection in the xz plane. If the transformed angles are 
denoted by the superscript t, then 9* = t — 6, tp* = 2 t — <p, x* — n + x- 
If yp* = yp, the wave function is said to be positive; if yp 1 = —yp, the wave 
function is said to be negative (the superscript t denotes the transformed 
wave function). The rotation about the y axis cannot change the elec- 
tronic wave function since it leaves the relative positions of electrons and 
nuclei unaltered; the reflection in the xz plane is equivalent to the 
operation <r„. 2 + states are unchanged by the operation <r v ; for 2~ 

states, the operation <r„ changes the sign of the wave function. For the 
2 states we thus have the result that (2 + )‘ = 2 + , (2 ~)‘ = — 2 - . 
For X 9 s 0, we have the doubly degenerate states II, A, etc. For these 
states the electronic wave functions F ± x will be of the form F±\ — 
/ d .xe ±<x ’’, where /±x is a function independent of <p. Since e ±lX(2 ’ r- '‘ ,) = 
gTix* we conc i U( j e that F+ x = F_x, FL\ = F + \. From 14-13, we see 

that 0/, x. M = (-1 _ x M gince e iM(r+x) = i)M e ai X) 

we have, for the transformed value of U(d, x), the result U j, \,m = 
(— 1 _x, Af. The complete wave function will transform in the 

same way as the product of F and U. For X = 0, we therefore have the 
following relations: 

1. If the electronic state is 2 + , then \p‘ — (— 1 ) J yp, so that a rota- 
tional state is positive for even J and negative for odd J. 

2. If the electronic state is 2“, then yp* = (— 1) J+ V, so that a rota- 
tional state is negative for even J and positive for odd J. 


* E. Wigner and E. Witmer, Z. Phystk, 51, 859 (1928). 
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When X ^ 0, we can form two linear combinations of the functions 
such that = ^1, = —^2; these combinations are 

ii - Vr. x, Af/+x® iX *’ + (~ -x, AfZ-xe-^ 

*2 = Vr. x, j/Z+x^ - (-1/- X C7/, — x, M/-xe- <Xv 

For X 5^ 0, we thus have both a positive and a negative state for each 
value of J. 

The positive or negative character of the wave functions is independent 
of the nature of the nuclei. For identical nuclei we have in addition 
the g or u property; the electronic wave function either remains the 
same or changes sign upon inversion at the origin. Since an interchange 
of the nuclei does not affect the Hamiltonian, the complete wave function 
must either remain unchanged or must change sign when the nuclei are 
interchanged. Wave functions of the type that remain unchanged are 
called symmetric in the nuclei; those that change sign are called anti- 
symmetric. Interchange of the nuclei is equivalent to an inversion of all 
particles through the origin, followed by an inversion of the electrons 
only. Under the first operation the wave functions transform according 
to their positive or negative character; under the second they transform 
according to the g or u property of the electronic wave function. We 
therefore have the following relations: 

1. For molecules in g states, positive terms are symmetrical in the 
nuclei, negative terms are antisymmetrical in the nuclei. 

2. For molecules in u states, positive terms are antisymmetrical in 
the nuclei, negative terms are symmetrical in the nuclei. In Tables 14*1 
and 14*2 we giye the symmetry properties of the various allowed states 
for diatomic molecules, 3 where the positive and negative property is 
designated by the symbols + and — , the symmetric-antisymmetric 
property by $ and a. 

14c. Selection Rules for Optical Transitions in Diatomic Molecules. 

According to the results derived in Chapter VIII, a transition between 
two states yp\ and resulting in the emission or absorption of radiation, 

is possible only if the integral j'yf'Wfo dr is different from zero (x f 

represents a/, y\ or z f ). Using the symmetry properties of the wave 
functions we can immediately derive several selection rules for transi- 
tions in diatomic molecules. Since x ' changes sign upon inversion, the 

integral V2 dr will be different from zero only if tlfc changes sign 

upon inversion. We thus have the selection rule : 

1. Positive terms combine only with negative terms. 
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Since interchange of the nuclei does not affect the electric moment 
integral for molecules with like nuclei; and since x f is unaffected by this 

interchange, the integral J tWfo dr will be different from zero only 

if the product does not change sign under this operation. This 
gives the additional selection rules for homonuclear molecules: 

2. Symmetric terms combine only with symmetric terms. 

3. Antisymmetric terms combine only with antisymmetric terms. 
We need in addition the selection rules for changes in /, X, and M. 

TABLE 14-1 

Symmetry op Rotational States for Homonuclear Diatomic Molecules 
J - 0 1 2 3 4 

2+ 4- - + - + 

8 a 8 a s 

2+ + — + — + 

a 8 a s a 

“ + — + “ 

a 8 a s a 

2„ “ + - + - 

s a 8 a 8 

n 0 +-+-+-+- 

s a sa sa 8 a 

n„ +-+-+-+- 

a 8 as a 8 as 

Ag 4 — 4 — 4 — 

sa sa sa 

A„ H — 4 — H — 

as as as 

TABLE 14-2 

Symmetry op Rotational States for Heteronuclear Diatomic Molecules 
J - 0 1 2 3 4 

2+ 4* ~ 4" ~ 4“ 

2 " - + - + - 

n 4- — 4-— 4-— +- 

A 4-- 4-- + - 

(Since the energy does not depend on M in the absence of a magnetic 
field the selection rule for AM is of little significance.) Using the trans- 
formations 14-1 for x, y , and z, we will obtain a series of integrals, each 
of which may be written as the product of two integrals, the first involv- 
ing the functions F ± \, R(r), and the coordinates x, y, z; the second 
involving the functions Uj t m(&, x)e ±tXv> and the angular coordinates 


A 

ti 
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0 , Xi <P • Since R(r ) is totally symmetric, the first of these integrals will 
be non-zero if the product of the electronic wave functions transforms 
under the operations of the group in the same manner as one of the 
coordinates. 

We consider first the transitions between two 2 states. For molecules 
with unlike nuclei, belonging to the group C*„, the possible direct prod- 
ucts of the irreducible representations are 

2 + 2 + = 2 +; 2 ~ 2 “ = 2 +; 2 + 2 ~ = 2 " 

The only coordinate which transforms like any of these products is z, 
which belongs to the representation 2+ The possible transitions are 
therefore 

2 + 2 + 2 “ *-> 2 “ 

For these transitions the electric moment is along the z axis; the band 
spectra corresponding to such transitions are known as parallel bands. 
For molecules with like nuclei, belonging to the group D^h, the possible 
direct products are 


2 + 2 + = 2 + 

2+27 = 27 

2727 = 2 + 

2727 - 2 + 

2+27 = 27 

2+27 = 27 

2+2 + = 2 + 

2 + 2 + = 2 + 

272+ = 27 

2727 = 2 + 




Again the only coordinate which transforms like a 2 state is z, which 
belongs to the representation 2+. The possible transitions are therefore 

2 + 2 + 2 " 2 ~ 

The integrals over the angular functions associated with the integrals 
over z are the same as those involved in determining the selection rules 
for the hydrogen atom, where it was shown that the integrals vanish 
unless 

A J = zfcl, AM = 0, ±1 

From Tables 14-1 and 14-2 it can readily be seen that the selection rule 
AJ = =fcl, together with the selection rules + <->■ — , s s, a a, 
give the same results for the allowed transitions between 2 states as the 
above considerations based on group theory. 

When one or more of the states has X 0, then the integrals over the 
angles give the following allowed transitions: 

x 9 y integrals 5* 0 AJ = 0, dbl; AX = dbl, AM = 0, zfcl 

(perpendicular bands) 

z integral ^ 0 AJ = 0, zfcl; AX = 0, AM = 0, zfcl 

(parallel bands) 
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These restrictions on J and X, plus the selection rules + — , s s> 

a o, are sufficient to determine the selection rules for such transitions. 
All allowed transitions for diatomic molecules 3 are listed in Table 14*3. 

We note that, for homonuclear molecules, all allowed transitions 
require a change in the electronic state; that is, pure rotation or rota- 
tion-vibration spectra are forbidden. 

TABLE 14-3 

Allowed Transitions in Diatomic Molecules 


c xv 


2 + <->2 + 

2 

2-^2- 


n <-»2+ 

n„^2j 

Rm 

n «->2~ 


Uu 

n <-> n 

n# <-> Hu 

a <-+n 

n g A w 

n w 

A A 

A g Am 


So far in this chapter we have not included the effect of electron spin. 
As mentioned in Chapter XI, molecules may possess singlet, doublet, 
triplet states, etc., according as the total spin angular momentum quan- 
tum number S is equal to 0, f , 1, • • •. For molecules, just as for atoms, 
the selection rule is AS = 0; this rule must be considered in applying 
the results given in Table 14-3. Tables 14*1 and 14*2 are correct only 
for singlet states, which are by far the most important. For a detailed 
account of the way the states are modified when S ^ 0, the reader is 
referred to a textbook on molecular spectroscopy. 4 

I4d. The Influence of Nuclear Spin. Since it is known that an 
atomic nucleus has spin angular momentum as well as mass and charge, 
we will have obtained an exact wave function for a molecule only when 
the wave functions discussed above are multiplied by a nuclear spin 
wave function <p. Designating the total wave function by yp tj we have 
\j/ t = yp<p y where ^ is the wave function previously discussed. We have 
previously stated that a wave function must be antisymmetric in elec- 
trons; the wave function must change sign when two electrons are inter- 
changed. It has been found that the total wave function must be 
chosen so that it changes sign whenever any two like particles are inter- 
changed; the behavior of ipt for the interchange of two protons or two 
neutrons is the same as for the interchange of two electrons. If, in a 
homonuclear molecule, the two* nuclei are interchanged, the total wave 
function will change sign if the number of particles (and hence the 

4 G. Herzberg, Molecular Spectra and Molecular Structure f 1, 231 , Prentice-Hall, 1939. 
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atomic weight) is odd, and will remain unchanged if the number of 
particles is even. We thus have the following rules for homonuclear 
molecules: 

1. If the atomic weight is even, the combinations and 

^(a)<p(a) are possible wave functions. 

2. If the atomic weight is odd, the combinations s)(p(a ) and 
i/(a)<p(s) are possible wave functions. 

The symbols s and a denote functions which are symmetric and anti- 
symmetric in the nuclei, respectively. Since it is known that protons 
and neutrons, as well as electrons, have spin J, the resultant nuclear spin 
is integral for even atomic weight and half-integral for odd atomic 
weight. For atoms with nuclear spin equal to zero, we can construct 
only one nuclear spin wave function ^o(l)^o(2), where ^o(l) means that 
nucleus (1) has zero spin, etc. This wave function is symmetrical in 
the nuclei. Since zero spin can occur only for even atomic weight, we 
have the following corollary to the above rules : 

3. For atoms with nuclear spin zero, only those states with wave 
functions \ p(s) are possible. 

When we construct nuclear spin wave functions as indicated below, 
we find that, if the nuclear spin is 7, then 

Number of functions <p{s) __ 7+1 _ Number of “ ortho ” states 

Number of functions cp(a) I Number of “ para ” states 


For molecules in 2 states this will result in alternating statistical weights 
for the alternating J values, giving alternating intensities of ratio 

to the lines in the rotational bands resulting from transitions 
between these states. 

We discuss first ortho- and parahydrogen. Since the proton has 
spin the possible nuclear spin wave functions are <p+u and corre- 
sponding to the spin wave functions a and 0 for an electron. We can 
therefore construct three symmetrical wave functions 

2 ) 

?+ h ( 1 V - h ( 2 ) + ^-^(1 Wh ( 2 ) 


and one antisymmetric wave function 

“ <p-hW<p+h(2) 

giving a ratio of symmetric to antisymmetric functions equal to 


3 - 


|+1 


Thp ground state of hydrogen is *2/. As hydrogen 
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belongs to case 2, we see that the function \p must be combined with 
antisymmetric nuclear spin wave functions for even values of J, and 
with symmetric nuclear spin wave functions for odd values of J . Since 
the forces acting on nuclear spins are very small, there is little possibility 
of a transition from a symmetrical nuclear state to an antisymmetrical 
one, hence hydrogen exists in two quite distinct species: orthohydrogen, 
with only odd J values allowed, and with nuclear spin statistical weight 
3; and parahydrogen, with only even J values allowed, and with nuclear 
spin statistical weight equal to 1. In hydrogen gas under ordinary con- 
ditions the two species are thus present in the ratio 3:1. A possible 
electronic transition for hydrogen is from the ground state *2* to an 
excited state 1 2j. The symmetry properties of these states are given 
in Table 14-4, with the possible transitions indicated by full and broken 
lines. We see from this table that transitions involving odd values of J 
in the ground state will be three times as intense as those involving even 
values of /. 

Deuterium has a nuclear spin of unity, so that the possible nuclear 
spin wave functions are <p+ 1 , <p 0 , v?-i. With these functions we can form 
six symmetrical combinations 

<p+ i(l)<p+i(2) <p+i(l)^>o(2) + ¥>-fi(2)^o(l) 

<Po (l)^o (2) ^ + i(l)^_.i(2) + 0+i(2)«0_i(l) 

<P-i(l)<P~i(2) <p 0 ( l)v?_i(2) +^ 0 (2)^>— i(l) 

and three antisymmetrical combinations 

<p+i(l)v>o(2) — < p + i (2)< po (1) 

^+i (2) — <p-fi(2V_i(l) 

«?o(1V-i( 2) — ^o(2V-i(l) 

TABLE 14-4 
*2+ -► l 2+ in H 2 

Excited State 


J = 0 

1 

2 

3 

4 

5 

a 

8 

•f 

a 

s 

+ 

a 

3 

<f> 8 

a 

8 

a 

8 

a 

Nuclear Spin o 
Statistical Weight 0 

1 

3 

1 

3 

1 

Ground State (*2p/ 
J = 0 

A 

/ \ 

v 

1 

A 

2 

0 

3 

0 

4 

/ 

/ 

f 

\ 

5 

\p 8 

' a 

+ 

8 

a 

8 

a 

<f> a 

8 

a 

8 

a 

8 

Nuclear Spin 1 
Statistical Weight 

3 

1 

3 

1 

3 
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Deuterium has the ground state x 2+ and belongs to case 1, so that ortho- 
deuterium has even J values, paradeuterium has odd J values; the 
ortho-para ratio is 2 : 1. For a transition such as that illustrated for 
hydrogen in Table 14-4, transitions involving even values of J in the 
ground state are twice as intense as those involving odd J values. 

The ground state of O2 6 is 3 27- O 16 has zero nuclear spin and there- 

fore belongs to case 3. In the ground state, only even J values are 
allowed. 

14e. The Vibrational and Rotational Energy Levels of Diatomic 
Molecules. The radial wave function R (r) satisfies the equation 

\sh + * ~ ~ - 0 1416 

where E is the total energy, E 0 (r) is the electronic energy, and E f (r)> 
the solution of 14*5, is, according to 14- 12 and 14*14, given by the 
expression 

E\ r ) =^p{ J ( J + l)-\*\ 14-17 


The electronic energy E 0 (r), according to the results of Chapter XI, has 
the following qualitative behavior. For r = a>, it has the value zero, 
corresponding to dissociation of the molecule. For some value r = r e , 
the equilibrium distance, it has the minimum value E 0 (r e ). For r — > 0, 
I?Q(r) — > 00 . Regardless of the exact form of E 0 (r), it can be expanded 
in a Taylor’s series about the position r = r e as 

E 0 (r) = E 0 (r e ) + (r - r e ) 


e ) 2 / d 2 E 0 (r) \ 

\ dr 2 A-, 


14-18 


Since r = r e is a position of equilibrium, 



Equation 14-16 may then be written as 


h 2 d / d\ 

,87r 2 /ir 2 dr \ dr) 


+ 6 - V (r) - 



- 0 


or 


Id/ , j -4 87 t 2 m , Tr/ 


R - 0 


14-19 
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where 

e = E — Eo(r e ) 

V(r) = E 0 (r) - E 0 (r e ) 

A = J(J + 1 ) - X 2 

The function V (r) acts in this equation as a potential energy; the energy 
origin has been shifted so that V(r) =0 for r = r e . The energy c now 


represents the energy of vibration and rotation of the molecule. If we 

now let R(r) = - S ( r ), the function S (r) satisfies the equation 
r 

8ir 2 n 


drS , i _ A 
df t+ 1 r 2 


+ • 


h 2 


(«- F(r)) S = 0 


To a first approximation, we may write 

I'M - Hr ~ r.f - hHr - r.)‘ 


14-20 


that is, the potential energy curve, to this approximation, is that of a 
harmonic oscillator. Using this value for the potential, the equation 
for S becomes 


djS 

dr 


2 + j— ^ + -jrr ( e ~ ik( r ~ r «) 2 ) 

h 2 A 


S = 0 


14-21 


Let us treat the term — 5 as a perturbation. Then, if we let 

8t h r 

x = r — r e , the equation for S is 


d 2 S 8 ir 2 n , 


14-22 


The boimdary conditions for this problem require that R = 0 for 
r = 0, r = + oo , which is equivalent to the conditions that <S = 0 for 
r = 0, r = + ao ; or >S = 0 for x = — r e , x = °o. At x — —r e , the 
potential energy is V (r) = §krj. For those states for which e -C §fcr 2 , 
the wave function will be quite small in the region x - — r t from these 
energy considerations alone. It will therefore be a valid approximation 
for small values of e to replace the boundary condition S = 0 for 
x = — r e by the condition S = 0 for r = — ». Equation 14-22 thus 
has the same form and the same boundary conditions as the equation for 
the harmonic oscillator problem ; the solutions are 
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where 


4:Tr 2 nv e 1 [k 

a " ir ’ 


and the energy levels are e„ = (n -f- \)hv„ 

The first-order correction to the energy will be 


, _ tfA 
*» ~ 0-2. 


/_. ^ (x) SM * 


87 T> «/_oo 


The term 


(a: + r e )‘ 


, may be expanded in terms of — as 


(x + r.) 2 2 

r. 


1 1 / XT? \ 

jpy’?.\ l - a z +t v + -) 


Only the first three terms in this expression being kept, the perturbation 
energy is 

e n = ^2 r*w{l-2£ + 3 - 2 } S.(*) dx 14-26 

8ir V** 7 -* l r, r.J 

The first integral is equal to unity. Using the recursion formulas for the 
harmonic oscillator (equation 4-99), we find that the second integral is 

3 

zero, the third is equal to — * (n + §), so that the first-order perturbation 

ctr; 

energy is 

£ = B' n A, B' n = B: - *'(» + \), B'. = r-y-j , 

07 T>r 0 


Of equal importance to the term in a' is the contribution to the second- 

2# f 

order perturbation energy arising from the term B e A. This gives 

U 

a contribution 

n _ K^-ll 2 + Kn+ll 2 _ K^-Kn+ll 2 14>2g 

*» *n — 1 <n ~ *n+ 1 Ap# 


Using the results 


(»l*l w + !) - > < n M n ~ 
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of equations 8-48 and 8-49, this becomes 

2B?A 2 


// 

*tl = - 


4t 2 ,M 


= —D'A 2 ; D' = 


The coefficient of A in the energy expression is Q 2 


4B' 3 

h 2 v 2 
8tt 2 h 


& 


14-29 


For the 


harmonic oscillator, is greater than ^ , so that a, is negative. For 

actual potential energy curves, the curve is flatte r in t he direction of 

greater r than in the other direction; as a result, is less than pj 

and a, is positive for actual molecules. The part of the vibration-rota- 
tion energy which is a function of A, or, as X is usually zero, of J(J + 1), 
is usually written, in analogy with the above results, as 


E, 


rot 

he 


= B n J(J + 1) - D n [J(J + l)] 2 


14-30 


where 

B n =B t - a„(n + |) B e - 
D n = D„ + & e (n + £) 

When the molecule is in the vibrationless state n = 0, its effective mo- 
ment of inertia can be found from the equation 

Bq = B t fa» 

In general, a t will be positive; therefore B 0 < B t and thus r<j > r,; 
that is, the average value of r, due to the zero-point vibrational energy, 
is greater than the distance r e corresponding to the minimum in the 
potential energy curve. For example, in HC1 the values of the constants . 
are B, =* 10.5909, a, — 0.3019, giving r„ = 1.2747 A, ro = 1.2839 A.® 
The quantities a„ D e , P, will usually be quite small in comparison to B e ; 
their exact analytical form depends upon the exact form of the potential 
energy curve. 

The above results are based upon a harmonic oscillator potential 
energy curve, which of course does not give a particularly good repre- 
sentation of the potential energy curves for actual molecules. We 
could include additional terms of 14-18 in the expression for the potential 
energy; a more satisfactory procedure is to assume some appropriate 

f Reference 4, p. 488. 
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analytical expression for the potential energy curve. Morse 6 has written 
the potential energy as 

V(r) = D e {l - 14-31 


which has the correct qualitative form; the constant D t (not to be 
confused with the constant used above) is equal to the depth of the 
potential energy curve at r- r e . Using this value for V (r), the equation 
for S(r) becomes 


d?S 

dr 2 


+ 


A 

r 2 


St 2 h 

IF 


- 5 + =rf (« - *>.U - e _o(r_r,) } 2 ) 


S = 0 14-32 


or, if 


h 2 A 

8wV* 


is to be treated as a perturbation 


fS 8tt 2 m 
dr 2 + h 2 


(e - D t { 1 - e~ a(r ~ r,> 


2 ) 5 = 0 


14-33 


If we now make the substitution x = e o(r r,) , equation 14-33 reduces to 


fS ldS 8 ttW < - D t 

dx 2 x dx a 2 A 2 \ x 2 


+ 




S = 0 


14-34 


Making the further substitution 

S(x) - e? yk(y) 

where 

V = 2/iD* x 


47T 


6 = — V2 fj^De-e) 


14-35 


we obtain as the equation satisfied by L(y) the result 

6 + 


d 2 L . (b + 1 _ \dL 
dy 2 \ y ) dy 




0 14-30 


The associated Laguerre polynomials satisfy the equation 


d 2 LlVi 


+ 


/ 2(H 
\ y 


+ D 


\dlft 1 m 

/ dy 


; ifitf = o 14-37 


Equations 14-36 and 14-37 will be identical if we set 6 + 1 = 2(1 + 1) 

and the quantity \~ V 2 pD, — ^ 

Ian ' 


equal to some integer n. This 


*P. Morse, Phye. Rev., 34, 57 (1929). 
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last step gives an equation which will determine the energy. We have 


~ < / 2ixD t - n + |J = ^ ^ V2 m(D, - *) 14-38 

from which 


= o) e (n + f) - x e c o 9 (n + %) 2 


where 




a_ fDe # 

7Tc\2fl * * 



14-39 


According to the results of 14*39 and 14*30, we should expect the vibra- 
tional-rotational energy levels to be expressible by a function of the form 


+ f ) - x e u ) e (n + \) 2 
he 

+ B n J(J + 1) - D n [J(J + l)] 2 14*40 

For example, the energy levels of HC1 (in the ground electronic state) 
can be very accurately reproduced by the equation 7 

= 2988.95 (n + J) - 51.65(n + f) 2 
he 

+[10.5909 - 0.3019 (ft- + f)] J(J + 1) - 0.0Q04[/(/ + l)] 2 

The values of the constants D e and a in the Morse curve can be deter- 
mined from the experimental values of o) e and x e u e ; however, the values 
of the dissociation energy Do = D e — \hv t as determined in this manner 
will usually be in disagreement with the values determined by more 
direct methods. 

14f. The Vibrational Spectra of Polyatomic Molecules. If we con- 
sider a molecule as a system of point masses (the atomic nuclei), then, 
according to section 2d, the kinetic and potential energies of the molecule 
can be written in the form 

3 V /r/OA 2 

T = iZ(~n , F = 14-41 

*-i \ at / *-i 

where N is the number of atoms in the molecule. Most of the simpler 
molecules will have certain elements of symmetry; they will belong to 
one of the symmetry groups of Appendix VII. Since subjecting a mole- 
cule to a symmetry operation cannot change the potential or kinetic 
energy of a molecule, the normal coordinates must transform in the 

7 Reference 4, p. 121. 
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following manner. If Qk is non-degenerate, that is, if no other A is 
equal to A*, then the symmetry operation R acting on Qk must change it 
either into itself or its negative, so that RQk — ± IQ*. If Qk is degener- 
ate, for example if Aj = A*, then the symmetry operation may change Qk 
into a linear combination of Qk and Qi. In general we would have 

ft 

RQik = Y aikQil 

l-l 

where the summation is over the /,• values of A for which A,/ = A,*. If 
S is another symmetry operation, then 

ft ft /. 

SRQik = Y aikSQu = Y Y a lk b ml Qi m 

l - 1 1-1 m— 1 

and if T is the resultant of the successive application of the two opera- 
tions R and S, T = SR, we have 

ft 

'RQik ~ Y, CmkQim 

Wl — l 

The coefficients therefore obey the relation 

Si 

Cmk ** bml&lk 

1 

We thus see that the normal coordinates form bases for irreducible repre- 
sentations of the symmetry group of the molecule in exactly the same 
way as do the eigenfunctions of H. If we form a reducible representation 
of the group based upon any arbitrary set of 3 N coordinates, and then 
find the irreducible representations of which it is composed, there will be 
as many distinct values of X (except for accidental degeneracy) as there 
are irreducible representations in the reducible representation. The 
degeneracy of a given X, that is, the number of normal coordinates which 
have this value of X as their coefficient in the potential energy expression, 
will be equal to the dimension of the corresponding irreducible represen- 
tation. Further, these normal coordinates will transform in the manner 
indicated by the matrices of the corresponding irreducible represen- 
tations. 

If we express the kinetic and potential energies in terms of an arbi- 
trary coordinate system as 

T = h'Yoii^ ~ V = IZbifllQ! 

then, according to 2-55, the values of X in equation 1441 are given by 
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the roots of the determinantal equation 

anX — 6 n ai2\ — &12 • * * — bi t zN 

- 0 

<*>3N, lX — &3AT, 1 * * • • * * <*>3N, 3 ~ &3 N, 3 N 

If, however, we use a set of coordinates (which are not necessarily nor- 
mal coordinates) that transform under the symmetry operations of the 
group to which the molecule belongs in the manner indicated by the 
matrices of the irreducible representations, then all cross products of 
the type QiQj, where Q- and Qj belong to different irreducible repre- 
sentations, will vanish; this choice of coordinates will thus greatly 
simplify the solution of equation 2-55. We shall return to this point 
briefly a little later; first we wish to derive the selection rules for optical 
transitions between the various possible vibrational states of polyatomic 
molecules. 

Using equations 1441 for the kinetic and potential energies of a mole- 
cule, the vibrational wave function is found to be 

3JV ^2 » o 2 3 N 

£ tJ? + w @ ~ * £ - 0 1442 

Making the substitutions ' 

3 N 3N 

*=n UQiY, ZEi 

•-1 t-1 

we obtain the set of 3N equations 

^ ^ (Ei - fr&Wi = 0 14-43 

each of which is the equation for a one-dimensional harmonic oscillator. 
Some of the X/s may, of course, be equal; the vibrational wave function, 
aside from a normalizing f actor, may be written as 

K. n„ ».-.»,» = r-*f*aiH» t <VZ t Q i ) 

i 

where H n( is the Hermitian polynomial of degree n< in (V a, Q,). 

According to the discussion in Chapter VIII, a transition from the 
state „ IW to the state . n> ' w is possible only if 

lArii, n„ • • • n lN X4 / n[, njjy &T 0 
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(i» this general discussion, x = x, y, or z ). But, according to 
section lOd, this integral will be zero unless the direct product 
rWv-.naJ r(^ n ;... n ;j is identical with T(x). This is a general 
statement of the selection rule for vibrational transitions in polyatomic 
molecules; all the selection rules for the appearance of fundamentals, 
overtones, or combinations can be derived from it. Here, however, we 
shall be interested only in the selection rules for the fundamental fre- 
quencies, that is, for transitions of the type 

$0 j. 0 2 , • • • Ojjv K. o 2t * • * u, • • * o 8iV 

The exponent in \f / ni . . . nzN has the same form as the potential energy, 
and hence is invariant with respect to every symmetry operation. The 
function ^ will thus belong to the same representation as the product of 
the Hermite polynomials. We therefore have the results 

T(t 0l , o 2 , • • • o 3A r) = Ti (the totally symmetrical representation) 

r(^ 0l , = r(Q,-) 

so that the selection rule for the appearance of fundamentals in the infra- 
red is: The frequency vi is infra-red active if T(Qi) = T(x), T(y), or 
T(z) y where T(Qi) is the irreducible representation to which the correspond- 
ing normal coordinate Qi belongs. 

According to section 8g, a frequency v a b is Raman active if one of the 
matrix elements of the type (a\xy\b) is different from zero. Proceeding 
as above, we therefore have the selection rule for the appearance of fun- 
damentals in the Raman effect: The frequency v{ is Raman active if 
V (Qi) = r(x 2 ), rty 2 ),r(z 2 ), r (xy), T(xz) 1 or T(yz) } where T(Qi) is the 
irreducible representation to which the corresponding normal coordinate 
Qi belongs . 



i 

i 


Fig. 14-2. Symmetry properties of the H 2 O molecule. 

We shall now proceed to the discussion of several examples. 

Example 1: H 2 0. H 2 0 belongs to the symmetry group C 2v ; the 
symmetry elements are the identity £7, C 2 (rotation about the z axis by 
180°, the xz plane is taken to be the H — 0 — H plane, the z axis goes 
through the O atom), <r v (reflection in the xz plane), (reflection in the 
yz plane) ; in Figure 14-2, the 0 atom is above the plane of the paper, 
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the hydrogen atoms are in the plane of the paper. The character table 
for this group is reproduced in Table 14*5; the character of the represen- 
tation generated by the possible motions of the molecule is included in 
this table. The character of this representation is found as follows. 


TABLE 14*5 

C 2 *(H 2 0) 






E 

c* 

a v 


x i , y 2 , z 2 


z 

Ai 

1 

1 

1 

1 




a 2 

1 

1 

-1 

-1 

xz 

Ry 

X 

B 1 

1 

-1 

1 

-I 

yz 

ft. 

y 

b 2 

1 

-I 

-1 

I 




r„ 

9 

-1 

3 

1 


Each atom can move in any direction. We therefore imagine vectors 
x», 7i, attached to each atom i, these vectors representing the dis- 
placements of this atom from its equilibrium position, and see how these 
vectors transform under the operations of the group. The operation E 
leaves each vector unchanged; therefore x(E) = 9. The operation C 2 
interchanges the hydrogen atoms, so that the contribution to xiCz) 
from these atoms is zero. For the oxygen atom, z — > z, y->— y, 
x — > —x under this operation, so that x(C 2 ) = — 1. c[ interchanges the 
hydrogen atoms, for the oxygen atom z — » z, y — »y, x — ► — x, so that 
x(0 = 1. For a v z — > z, y-+— y, x-*x for each atom, so that 
x(<Tv) = 3. Breaking the reducible representation down into its irre- 
ducible components, we have 

= 3 A x +A 2 + 3 B x + 2 B 2 

Certain of the normal coordinates which belong to these irreducible 
representations represent translational and rotational motion. Since 
the vectors representing translational motion transform in the same way 
as the coordinates x , y , and z , and the vectors representing rotational 
motion transform in the manner given in the table, we have for the 
representation which has these motions as its basis 

r*. r = Ai + A 2 + 2 Bi + 2B 2 

Subtracting T it r from T m> we obtain 

r vi b ** %Ai + Bi 

as the representation which has the vibrational motions as its basis. 
There are therefore three distinct vibrational frequencies for H 2 O. 
Two of the normal coordinates associated with these frequencies belong 



278 THE PRINCIPLES OF MOLECULAR SPECTROSCOPY 

to the totally symmetric representation A x ; one belongs to Si, and this 
normal coordinate changes sign when the hydrogen atoms are inter- 
changed. In order to find the actual normal coordinates we would have 
to solve the secular equation with some assumed form of the potential 
energy. We can, however, find by inspection certain coordinates which 
have the necessary symmetry properties. If, for simplicity, we assume 
that the mass of the oxygen atom is infinite, a possible set of such 
coordinates would be 

Qi = a f <p + i/(r x + r 2 ) 

Q2 = c'<p + d! (r x + r 2 ) 

Qs = e'(r t - r 2 ) 


where <p is the H — O — H angle and r x and r 2 are the O — H distances. 
(Coordinates of this type must always be chosen in such a way that they 
do not introduce any translational or rotational motion.) With these 
coordinates, the secular determinant, which would in general be a three- 
row determinant for H 2 0, is broken down into the product of a two-row 
and a one-row determinant, since the products Q 1 Q 3 and Q 2 Q 3 may not 
appear in either the kinetic or potential energies. Solution of the deter- 
minant, with an assumed potential energy, would then lead to the 
proper linear combinations of Q[ and Q 2 which are the actual normal 
coordinates. When this is carried out, it is found that one of the normal 
coordinates belonging to A x represents (approximately) a stretching of 
the 0 — H bond; the other represents a bending of the O — H bond. The 
normal coordinates are thus approximately 

Qi = Q*P Q2 = 6(r x + r 2 ) Q 3 = c(r x - r 2 ) 


The modes of vibration associated with these normal coordinates are 
illustrated in Figure 14*3. Actual calculations involving the solution 

A of the secular determinant are 

A , Qi.= a<t> 


A 


rather difficult and are carried 
out only if one wishes to find the 
force constants in some assumed 
potential energy expression. By 
inspection, diagrams of the type 
Qs^ c ( r i- r 2 ) in Figure 14-3 can usually be 

_ „ . , . drawn. These will not in gen- 

Fig. 14*3. Normal modes of vibration of H 2 0. , , . & 

eral represent normal coordi- 
nates but merely coordinates with the proper symmetry. 

From the selection rules for the infra-red transitions and Raman 
scattering, and the transformation properties of the coordinates and 


4i 
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products of the coordinates, we see that in H 2 0 all fundamentals are 
allowed in both spectra. 

In studies of the Raman effect, the degree of polarization of the Raman 
lines is usually measured; this information often aids in assigning the 
observed frequencies to particular modes of vibration of the molecule. 
If the incident light is traveling in the —y direction and the scat- 
tered light is observed in the x direction, then, if the incident light 
was polarized in the z direction, the degree of depolarization is defined 

by the equation p = ~~ , where 7 (y) and I (z) are the observed inten- 
1 ( z ) 

sities of light polarized in the y and z directions, respectively. The 
theory of the polarization of Raman lines is given by Kohlrausch. 8 
If the incident light is unpolarized, then the degree of depolarization p n 

2 p 

of the scattered light is given by p n = — — . The degrees of depolari- 

1 + p 

zation are given by the following rules: 

1. If the matrix elements of x 2 , y 2 , z 2 are all zero, then 


P = f, Pn = 

2. If not all the matrix elements of x 2 , y 2 , z 2 are zero, then 

0<p<f, 0 < Pn <f 

For H 2 0, we see that the frequencies belonging to A\ have p n < fy; the 
frequency belonging to B\ has p n = 

Example 2: Acetylene. In Table 14*6 we have reproduced the 
character table for the group Dooh f to which acetylene belongs, and have 


TABLE 14*6 


£U(C 2 H 2 ) 




E 

2 C" 

Ci 

iE 

2 iCy, 

iCi 

x 2 + y 2 , z 2 



Alg 

1 

1 

1 

1 

1 

1 




Ai u 

1 

1 

1 

-1 

-1 

-1 




Aty 

1 

1 

-1 

1 

1 

-1 



z 

A2u 

1 

1 

-1 

-1 

-1 

1 

(**, yz) 

(R&t Ry) 


Ei g 

2 

2 cos ip 

0 

2 

2 cos <p 

0 



(x, V) 

Eiu 

2 

2 cos <p 

0 

—2 

—2 cos# 

0 

(*’ - y\ xy) 



E<i 0 

2 

2 cos 2 <p 

0 

2 

2 cos 2# 

0 



1 

E%u 

2 

2 cos 2 <p 

0 

-2 

-2 cos 2 p 

0 




r m 

12 

4 -f 8 cos 

0 

a 

0 

4 


* 9 K. Kohlrausch, tier Smekcd-Raman Effekt , p. 27, Springer, Berlin, 1938. 
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included in this table the character of the representation which has all 
possible motions of the molecule as its basis. For the operation C p , 
all z coordinates are unchanged, giving a contribution 4 to x (<?*,)• The 
matrix of this transformation for the x and y coordinates has the charac- 
ter 2 cos <p, so that these coordinates give a contribution 8 cos <p to 
x(C^). The characters for the other operations are found in a straight- 
forward manner. Breaking the reducible representation down into its 
irreducible components, we have 

r m = 2Ai g + 2A2u + 2j E i g + 2E\ U 
Subtracting the representations for the external motions leaves 
Fvib = 2Alg + i4.2 M + Elg + Elu 

so that there are two non-degenerate modes of vibration belonging to the 
totally symmetric representation Ai 0 , one non-degenerate mode which is 
antisymmetric to inversion, and two doubly degenerate modes. In 



0-*— 

— 0— 

— ** A^ g 

R. 

p< a 
h n 7 


— 

— 

---*■• A ip 

R. 





Ao 

l.R. 










t E U 

R. 

p n =f 


-s_ 

— i — 

• E 1 « 

I.R. 



Fig. 14-4. Normal modes of vibration of C 2 H*. 


Figure 144 we present a schematic representation of possible coordi- 
nates having the required symmetry, as well as the selection rules for the 
corresponding frequencies. 

Example 3. Benzene. The results of a similar calculation for 
benzene are given in Table 14-7. In determining the characters of the 
reducible representation for the operations C 2 and C 2 it is convenient to 
let the axis of rotation be one of the coordinate axes x or y. Benzene is 
seen to have twenty frequencies, of which ten are doubly degenerate. 
Of these frequencies, those associated with the representations E\ u 
and A 2u are infra-red active, so that the infra-red spectrum should con- 
tain three degenerate frequencies and one non-degenerate frequency* 
The frequencies associated with E lff , E 20J and A\ g are Raman active, so 
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that the Raman spectrum should contain five degenerate and two non- * 
degenerate frequencies. The other frequencies do not appear (as funda- 
mentals) in either case. 

TABLE 14*7 


Dafc (CeHe) 

E C 2 2 C z 2 C 6 3 C' 2 SC* iE iC 2 2 iCz 2iC* 3 iC' 2 MC* 


X 1 + y 2 , ** 


WM 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 



Wm 

1 

1 

1 

1 

1 

1 

-1 -1 

-1 

-1 

-1 

-1 


ft. 

JSM 

1 

1 

1 

1 

-1 

-1 

1 

1 

1 

1 

-1 

-1 


Z 

Hlf 

1 

1 

1 

1 

-1 

-1 

-1 -1 

-1 

-1 

1 

1 



Bn 

l -1 

1 

-1 

1 

-1 

1 -1 

1 

-1 

1 

-1 



Bli 

1 ~i 

1 

-1 

1 

-1 

-1 

1 

-1 

1 

-1 

1 



B 2 q 

1 -1 

1 

-1 

-1 

1 

1 -1 

1 

-1 

-1 

1 



b 2u 

1 ~i 

1 

-1 

-1 

1 

-1 

1 

-1 

1 

1 

-1 

(** - y\ xy) 


E 2 g 

2 

2 

-1 

-1 

0 

0 

2 

2 

-1 

-1 

0 




e 2u 

2 

2 

-1 

-1 

0 

0 

-2 —2 

1 

1 

0 


(«, V») 

(ft» ft„) 

E\g 

2 —2 

-1 

1 

0 

0 

2 -2 

-1 

1 

0 

H 


(x,y) 

E\u 

2 —2 

-1 

1 

0 

0 

-2 

2 

1 

-1 

0 

n 



r m 

36 

0 

0 

0 

-4 

0 

0 12 

0 


0 

4 


r m =* 2Ai e 4 2 A 2g 4 2 A 2« 4 2B\ U 4 2J3 2 0 4* %B%u 
4- 4tE 2 g 4" 2E 2u 4" 2Eig 4* AEit* 

Pvib = 2Aj^ 4- A 2g 4* A 2tt 4 2Bx w 4- 2 4- 2 B 2u 
4 4 E 20 4- 2 E 2u 4 Ei 0 + 3Ei v 


If the molecule has a center of symmetry, then x , y, and z belong to u 
representations while; their products belong to g representations. In 
such molecules, therefore, a frequency cannot appear as a fundamental 
in both the infra-red and the Raman spectrum. 






CHAPTER XV 

ELEMENTS OF QUANTUM STATISTICAL MECHANICS 

Our study has thus far been limited to the discussion of the structure 
and properties of individual atoms and molecules. The determination 
of the properties of a system of atoms or molecules (“ particles ”) from 
a knowledge of the properties of the individual particles requires the use 
of the methods of statistical mechanics. We do not intend to treat this 
subject in any great detail; but it is of interest at this point to see how 
the connection between the properties of individual particles and the 
properties of systems of these particles is made. 

Let us consider a system of n particles, all of the same kind. We 
represent the coordinates of the n particles by the symbols q\, <{2, qz • • • q n , 
where g,- represents all the coordinates necessary to specify the state of 
the ith particle. We assume that the Hamiltonian operator H of the 
system can be expressed in the form 

H - H(ji) + H ( ft ) + • • • + H(f») 15-1 

If we represent the wave function for the system by ^ n , then the energy 
levels E n of the system are given by the solutions of the equation 

H*„ = E n i n 15-2 

Because of the assumed form of H, this equation may be separated into 
the n equations 

H (9l )<f>a{qi) = *a<f>a(qi) 

H (52)^6 ($2) = tVPbiqz) 

15-3 


^■{qn)<Pm(Qn) (Qn ) 

The function $ will be expressible in terms of the <p’s; we must, however, 
take a combination of the <p's which will give ^ the proper symmetry. 
These symmetry requirements lead to three possible cases, which we now 
discuss. 

15a. The Maxwell-Boltzmann Statistics. We consider first the 
classical case, where our system contains n distinguishable particles. In 
this case there are no symmetry restrictions on any combination of 

282 



THE MAXWELL-BOLTZMANN STATISTICS 


283 


the p’s such as 

An = Po(?l)p&(?2) • * • Pm(3n) 154 

will be an acceptable wave function for the system. Since the particles 
are distinguishable, any interchange of particles among the occupied 
states <p a , <pb • • • <f>m will lead to a new state for the system. 

Let us now divide the p’s into the groups 1 , 2 • • • k • • • so that the 
eigenvalues for all the p’s in the fcth group lie between the limits «* 
and tk + dtk, and let us suppose that there are p* p’s in the kth such 
group. We now ask the question: How many A’s correspond to a dis- 
tribution of the n particles such that there are ni in the energy region 
corresponding to group 1, n 2 in the energy region corresponding to 
group 2, etc. ? (If a particle is in the kth energy region, it may be con- 
sidered to have the energy «*.) In order to answer this question, let us 
calculate the number of different ways in which we can distribute the n 
particles among the various regions so that there are nj particles in 
region 1, etc. For simplicity, let us consider that we have only two such 
regions. We readily obtain the following table: 

n n\ ri2 N 

10 11 

1 0 1 

2 0 2 1 

1 1 2 

2 0 1 

3 0 3 1 

1 2 3 

2 1 3 

3 0 1 

4 0 4 1 

1 3 4 

2 2 6 

3 1 4 

4 0 1 

(N = number of ways of obtaining the given distribution.) In order 
to see how this table was formed, consider the case where n — 4, = 2, 

n 2 = 2. If the particles are numbered from 1 to 4, the six possible 
distributions are: 

Region 1 Region 2 
12 3 4 



284 ELEMENTS OF QUANTUM STATISTICAL MECHANICS 


Since the order in which we select the particles which we are going to put 
into a given region is immaterial, the designations 1 2 and 2 1 are equiva- 
lent and are counted only once. It is to be noted that, in each of the 

Til 

cases listed, N„, = — : — ; (0! defined to be 1). If the above table 

* «i!n2! 

is extended to include more particles or more regions, we find that the 
same expression holds. The number of ways of distributing the parti- 
cles among the different regions will therefore be 


N n 


n! 


nj, n s • • • 


Hi ln 2 ! 


n k l 


15-5 


We now wish to calculate the number of corresponding to the above 

distribution. We denote this number by G n ,n lf n 2 .--n k Since 

there are g k <p s in the &th region, each of the n k particles can be in any 
3ne of the g k different <p’ s, since there is no restriction regarding the 
lumber of particles that can occupy a given <p. The n k particles can 
thus be put into the g k states in g k k different ways, each corresponding 
bo a new This gives us the relation 


G n , 


? . . . n* • • • « fi'fi 2 


gT 


N n, n lt n 2 •••«*•• 


15-6 


)r 


Gn t n lt n 2 » • • n k • • . - II 


nig? 

n k \ 


15-7 


This is the answer to our first question. We now ask the additional 
luestion: If our system contains a fixed number of particles n, and a 
ixed total energy E , what distribution is the most probable? In order 
o answer this question, statistical mechanics makes the following 
assumption: Each state of the system which is consistent with the 
equirements that n = constant and E = constant has the same 
t priori probability; that is, any two such states have equal chances of 
ccurring. It therefore follows that the probability of the occurrence of 
, given distribution is directly proportional to the number of 
tates representing such a distribution; if we call this probability 
*», »j • • * «* • • • t then 

n, t • n k •mm = CGn, A| ( n 2 ... ti k • • • 15*8 

rhere C is a constant. If we consider P to be a function of the 
hen the most probable distribution will be that for which P is a maxi- 
lum, or that for which SP « 0 (where the variation is with respect to the 
k ’s), subject, of course, to the restriction that 5n = 0 and SE = 0. 
ince log x is a maximum when x is a maximum, it will be equally valid 
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and more convenient to determine the distribution for which 6 log F « 0. 
According to 15-7 and 15*8, 

log Pn. », . . . = log c + log n! + 5>* log g k - E log n k \ 15-9 

k k 

For x sufficiently large, Stirling’s formula states that log xl = 
x log x — x. Using this relation, the condition S log P = 0 gives 

flh 

E log — 5n* = 0 15-10 

* 9k 

Equation 15-10 must be solved subject to the restrictions 

Sn = E^* = 0 15-11 

k 

SE = E«*«n* = 0 15-12 

k 

The desired solution can be obtained by the method of Lagrangian multi- 
pliers; we multiply 15-11 by the parameter a, 15-12 by the parameter 0, 
and add the three equations, obtaining 

E ( log - + a + 0a* ) Sn k = 0 15-13 

k \ 9k / 

The variations 5n k may now be considered to be arbitrary; therefore 
equation 15*13 can hold only if 

Tlk 

log — + a + pe k = 0 15-14 

Qk 

for all values of k. The most probable distribution will thus be that for 
which each rik satisfies the equation 

n* - g k e~ a e- pa 15-15 

Equation 15-15 is correct only when each n* is large enough so that the 
Stirling formula does not introduce any error. When this approximar- 
tion cannot be used, the most probable distribution would have to be 
computed directly from 15-9. The systems ordinarily considered in 
chemistry contain a sufficiently large number of particles to make the 
Stirling formula adequate. 

15b. The Fermi-Dirac Statistics. If our system contains n indis- 
tinguishable particles, and we require that the total wave function ^ 
be antisymmetric with regard to interchange of two particles, we are led 
to the Fermi-Dirac statistics. As we saw in Chapter DC, an antisym- 
metric wave function can be represented by a determinant, or by the 
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* - * • •*■(*•)} 15-16 

This method of writing is equivalent to stating that no state <p may 
contain more than one particle. Since the particles are indistinguish- 
able, it does not matter which particle we choose to occupy a given state; 
the number of ^’s corresponding to a given distribution will be equal to 
the number of ways in which we can select occupied ^'s for this distribu- 
tion. As before, we divide the <p’s into groups of essentially equal 
energy. If there are g k <p’s in the 4th group, then the number of ways 
N 0kt nk in which we can select n k occupied <p’s is illustrated by the 
following table: 


Qk nk 

2 0 

1 
2 

3 0 
1 
2 
3 

4 0 
1 
2 

3 

4 

5 0 
1 
2 

3 

4 

5 


N gkt nk 

1 

2 

1 

1 

2 + 1 - 3 
1 + 2-3 

1 

1 

3 + 1-4 

3 + 3-6 
1 + 3-4 

1 

1 

4 + 1-6 
6 + 4-10 
4 + 6-10 
1 + 4-5 

1 


Since a given <p can be occupied by not more than 1 particle, n k < g k . 
The method of forming the above table can be illustrated by the case 
g k = 5, n k *= 2. If the first <p is empty we have 2 particles to be divided 
among 4 states, which, according to an earlier section of the table, can 
be done in 6 ways. If the first <p is occupied, we have 1 particle to be 
distributed among 4 states, which can be done in 4 ways, so that, for 
g k = 5, njk = 2; N gkt nu = 6 + 4 * 10. In a similar manner, each line 
of the table can be obtained from some preceding lines, so that the entire 
table can be built up from the trivial case g k — 2. It is to be noted that 
in every case 

A 7 _ £*! 

n k \(g k - n*)I 


16-17 
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a result which is perfectly general. The number of ways in which a given 
distribution can occur will evidently be equal to the product of the num- 
ber of different ways that we can select occupied states in the various 
regions, that is, 

G n , »i. «.-••«*•• • = P n> = 15-18 

The remainder of the analysis for the Fermi-Dirac statistics is identical 
with that for the Maxwell-Boltzmann case. We find that the most 
probable distribution is that for which 


* e a e? ,k + 1 


1519 


15c. The Bose-Einstein Statistics. If our system contains n indis- 
tinguishable particles, and we require that the total wave function ^ be 
symmetrical with respect to the interchange of two particles, we obtain 
the Bose-Einstein statistics. A symmetric wave function may be repre- 
sented by the linear combination 

' ^ • # * Vm{qn)\ 15*20 

p 

For this case of symmetric wave functions, there is no limitation on the 
number of particles which we can put in a given state <p . Aside from 
this difference, the procedure is identical with that followed in the Fermi- 
Dirac statistics. The analogous table is: 


Ok n k N 0k> ni 

2 0 1 

1 2 

2 3 

3 4 

4 5 


1 ™ 1+2 = 3 

2 1 + 2 + 3 - 6 

3 1+2 + 3 + 4 = 10 

4 1+2 + 3 + 4 + 5 = 15 


4 0 1 

1 1 + 3=4 

2 1+3 + 6-10 

3 1+3 + 6 + 10 = 20 

4 1 + 3+6 + 10 + 15 = 35 
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For g k =* 2, n k = 2, the first state can contain either 0, 1, or 2 particles, 
the remainder being in the second state, giving 2V 2 , 2 = 3, etc. The 
section of the table for g k = 3 can be obtained from that for git = 2 as 
follows. For gk — 3, n k = 4, the first state can have either 4, 3, 2, 1 , or 0 
particles in it. If the first state has 4, the remaining 0 particles can be 
distributed among the remaining 2 states in 1 way; if the first state has 
3, the remaining particle can be distributed among the remaining 2 
states in 2 ways ; if the first state has 2, the remaining 2 particles can be 
distributed among the remaining 2 states in 3 ways; etc., so that 


N». 4 


In each case 


l+2 + 3+ 4 + 5 = 15 = £AT 2i< 

t-0 


_ ( n k + gt — 1 ) ! 
Qi ' n * n k \(g k - 1)! 


15-21 


so that 


G n 


tt (n k + g k - 1)! 
* n*!(flb-l)! 


15-22 


The completion of the analysis shows that, for Bose-Einstein statistics 
the most probable distribution is that for which 


n k = 



15-23 


The expressions forn* (15-15, 15-19, 15-23) contain two parameters, 
a and 0. The value of a may be determined by means of the require- 
ment that 

= n 15-24 

k 

The parameter /3 may be evaluated by calculating some property of the 
system, such as, for example, the pressure if our system is a perfect gas, 
and comparing the calculated with the observed value. In this way it is 

found that, for each of the three statistics, where k is the Boltz- 

A '1 

mann constant and T is the absolute temperature. For all actual 
systems (except electrons in metals and gases at temperatures very close 
to the absolute zero), e a (P tk » 1 . The term ±1 in the denominators of 
15-19 and 15-23 may thus be neglected; the three statistics then give 
the same result 

n k = g k e~ a e~ fi,> = g k e~“e kT 


15-25 
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• k 

Since Y, n k = e~ a Y<Qk e ~ kT = » 

A: A 


we have 


n 





tk 


Egke kT 


15-26 


as the general expression defining the most probable distribution. 
Equation 15*26 may with equal validity be regarded as the probability 
that a given particle will be in the state with energy 6*. 

15d. The Relation of Statistical Mechanics to Thermodynamics. 
The relation between the energy levels of a system and its thermo- 
dynamic properties may be simply derived by the following argument. 
Let us regard the “ particle ” in the above discussion not as a single atom 
or molecule but as, let us say, a mole of any chemical substance; that is, 
we now regard our “ system ” as a “ par tide/ 7 An analysis identical 
with that presented above then leads to the result that the probability 


that the system be in the state with energy Ei is equal to 


Ei 



Ei * 


The 


2> 

i 

average energy of the system, the thermodynamic internal energy E , is 
therefore given by the expression & 



i 


15-27 


and the specific heat at constant volume will be 




15-28 


If we now calculate the entropy of this system, we have 

5 - s ° - C r dT - fjf «• 


By integrating by parts, we obtain 

G? ■<* V *)). + * SXtt log iT 

= f + * log fee'll - k (log {s>~ **}) 

1 i \ i /Tm o 


15-30 
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The constant term may be identified with So", we note that 
So = k log g 0 , where go is the “ statistical weight ” of the ground state, 
that is, the number of eigenfunctions corresponding to this state. This 
is the statistical-mechanical formulation of the third law of thermo- 
dynamics. If, as is usually true, the ground state of a system is non- 

E A 

degenerate, then S 0 = 0. Since S = — — — , we have the final result 


A = —kT log Z 


15-31 


where 

Z = Y. e ~^ 

n 

and the summation is over all energy levels E n corresponding to the 
allowed eigenfunctions of the system. The effect of phase transi- 
tions is the addition of the same term to TS and E; equation 15*31 is 
thus perfectly general. The quantity Z is known as the “ sum-over- 
states” or “ partition function” for the complete system. 

Where it is possible to express the energy E n as a sum of terms each of 
which depends on one particle only, it is convenient to write Z as a func- 
tion of the partition functions of the individual particles. If is 

&» = VailRbPbfa) * • • <Pm(q n ) 15*32 

then ^ 

^n-ea(l)+€ b (2)+***+€ m (n) 15*33 

where « a (l) signifies that particle (1) has the energy e a . We now con- 
sider a system where the available number of states v is much greater 
than the number of particles n — a condition which is well satisfied for 
most systems of chemical interest. It will then be extremely unlikely 
that a given tp will be occupied by more than one particle, and the possi- 
bility of the occurrence of such states can be neglected. From 15*33 we 
note that a given value of E n can then be obtained in n! different ways 
which differ only in the numbering of the particles. For the Maxwell- 
Boltzmann statistics, each of these n! ways of obtaining E n corresponds 
to a different function \f/, since the particles are distinguishable. For 
this case, therefore, we may write 

En *(l)+-(2)+ . . . +*(n) 

Z e kT 15*34 

n h (1),«(2) . . . 

where the second summation is over all possible values of «i(l), c»(2), 
etc. For the Fermi-Dirac or Bose-Einstein statistics, where the parti- 
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cles are indistinguishable, the nl ways of obtaining E n according to 15-33 
represent only one wave function rj/. * If we were to form a sum of the 
type 15-34 for these cases, we would have, corresponding to a given value 
of E n , nl terms in the summation on the right which differ only in the 
numbering of the particles. Since our summation is to contain only 
one term for each distinct a summation of the type 15*34 is thus too 
large by a factor nl The correct expression for indistinguishable 
particles is therefore 

En 1 _ «(!)+« (2) + ••»+»(») 

Ze~ kT = - E e kT 15-35 

If the particles are all alike, as we have assumed them to be in this 
discussion, then 

_ «»(D +«i(2) + • *« +<i(n) ✓ _ «i\ n 

E e kT =('£e kT ) =r 15-36 

«(l),«i(2), ••• \ i ) 

where / is the partition function per particle. We have as our final 
results: 


For distinguishable particles 

z =/* 

15-37 

For indistinguishable particles 

z %V 

15-38 


If our system is a perfect gas, the particles are obviously indistinguish- 
able, and the correct partition function for the system is given by 15-38. 
If our system is a perfect crystal, the particles are distinguishable 
because of their fixed positions in space, and the correct partition func- 
tion for the system is given by 15-37. Intermediate systems such as an 
imperfect gas or, more important, a liquid, introduce considerable diffi- 
culty. In the first place, it will not in general be very exact to write the 
Hamiltonian in the form 15-1. If we make the approximation that the 
interactions of the neighboring particles with a given particle can be 
represented by some average potential field, this separation of the 
Hamiltonian can be achieved; the question then arises whether we 
should use 15-37 or 15-38. Since the particles are actually indistinguish- 
able, and since they have a certain amount of mobility, it would appear 

that the correct partition function would be Z =-~zf n , where 

N 

1 < N < nl. It seems likely, however, that any exact treatment of 
such systems must be based upon the use of a Hamiltonian which con- 
tains terms involving the coordinates of more than one particle. 
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15e. Approximate Molecular Partition Functions. The partition 
function for a single molecule, as defined above, is given by the relation 

/ = £ e~^ 15-39 

i 

where the summation is over all the allowed energy levels of the mole- 
cule. Equation 15-39 may alternatively be written 

/ = Z 15-40 

% 

where Qi is the degeneracy or statistical weight of the tth level. To a 
good approximation, e* may be expressed as the sum 

= e%(t) + €i(v) + ct (r) + €j(e) + c<(n) 15*41 

where €,*(£), €*(r), €»(e), c»(n) are the energy levels associated with 

translational, vibrational, rotational, electronic, and nuclear motions, 
respectively. To this approximation, the partition function for the 
molecule becomes 

f~ftfvfrfjn 15*42 

where 

«ttt) __ t i(Q 

ft = Eg.(<)e kT , fv = Y,g<(v)e kT , etc. 15-43 

t % 

We now consider these terms separately. As far as nuclear energy is 
concerned, it may be taken equal to zero. The nuclear partition func- 
tion then becomes f n = g n8y where g nt is the nuclear spin statistical 
weight. Since the allowed rotational levels are dependent upon the 
nuclear spin wave functions, as we have seen in section 14c, it is conven- 
ient to combine the nuclear spin statistical weight with the rotational 
partition function, and write 

Sr ~ frfn = Y.9n.(i)gi(r)e~ kT 15-44 

« 

The partition function for electronic energy cannot be further simpli- 
fied; however, the first excited electronic state is usually so high above 

__ «ifo) 

the ground electronic state that the term e~ u may be neglected in com- 
parison with unity. Then the electronic partition function is Qo(e), 
where g 0 (e) is the degeneracy of the ground state. 

According to section 14e, the vibrational energy of a polyatomic mole- 
cule, to the harmonic oscillator approximation, is 

t(v) = (% + %)hv i -| + (rii + \)hvi -| + (»* + i)hvi, 

™ 0 , 1 , 2 , 3 * * • 


15-45 
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where v{ is the fundamental frequency of the ith vibrational degree of 
freedom and there are k vibrational degrees of freedom. Now 

00 ( n + W )^ hr « nhv 

£ e~ *r = e 2 tr £ e ~ *r 

n —0 n «-0 


I — ( _ biL \ 2 / \ 3 \ 

= e 2tr|i+c *r + Ve **y + \e~^) + . . -} 15-46 


hv 


= e" 2 * 7- 


1 


1 

— ba 

- e kT 


so that the vibrational partition function for a polyatomic molecule, to 
the harmonic oscillator approximation, is 


i **k 

fv = n 

*-i 


hr% 

e ~2kT 


hn 

e ~kT 


1547 


The translational energy levels of a molecule are given by the solution 
of the wave equation for a particle in a box (section 5b). If the mass of 
the molecule is m , and if it is constrained to move in a rectangular box of 
edges a, 6, and c and volume V = abc, the translational energy levels are 


E ~ S^\J + b’ + ?) 


16-48 


1549 


n z , n v , n t = 1, 2, 3, 4 • • • 

The translational partition function is therefore 

00 h*nl 00 **n; 00 h*n[ 

ft~Y* e 8 ma*kT £ ZmWcT £ mc*kT 

Rf *1 Tly *■ 1 f|f — l 

h 2 

Now the quantity — ' s very much less than unity for ordinary 

A 2 n2 

temperatures and reasonable values of a, so that changes only 

slightly as we vaiy n x . For this reason it is permissible to replace the 
summations by integrations, and write 



h*n\ 

e~8 maficT dfi x 



h*nl 

e~8mwtr dn y 



jw»J 

e _ 8mo*kr dn. 


15-50 


according to Appendix VIII, 



(2 rmkT)* a 


15-51 
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so that the translational partition function becomes 
r (2wmkT) H (2irmkT) H Tr 

/. — — — y 


15-52 


Classically, the energy of a system is given by the equation H(p, q) — E. 
The classical analogue of equation 15-40 is therefore 

1 r* g) 

/ = j e kT dpi • • * dp n dq t -- • dq n 15*53 


where the factor — has been introduced so that / may be non-dimen- 
sional. The introduction of this specific factor may be further justified 
as follows. For a particle of mass m moving under the influence of no 
forces, the Hamiltonian function is 

H = (rf + Vl + Vl) 15-54 


so that equation 15*53 gives for the translational partition function the 
result 


i n n /»°° (pi+pv+pd r r r 

ft = ^3 J J J e 2mkT dp x dp y dp z J J J dx dy dz 15*55 


If the particle is constrained to move in a volume V , then 


J J* J'dxdy dz 


V 


Integrating over the momenta, we obtain 


ft = 


(2t rmkT)*V 
H 3 


15*56 


which is identical with 15*52. 

We next consider the rotational partition functions for diatomic 
molecules. According to section 14c, there is no restriction on 
the allowed values of the rotational quantum number J if the nuclei are 
different. If the two nuclei have spins $i and s 2 the nuclear spin 
statistical weight is (2$ x + 1)(2$ 2 + 1). The rotational energy levels 
are 

Ej = &i J{J + l) 15,57 

* 

where 7 is the moment of inertia, and each level is (2 J + l)-fold degen- 
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©rate. The rotational partition function f R is thus 

oo hKT(J+l) 

fa = (2*i + 1)(2* + 1) 2 (2 J+ l)e~ 8 ^* r 15-58 

JmO 


For large values of I and T, we may replace the summation by an inte- 
gration, and obtain 


e 

0 


W(/+l) 

8**IkT (2J + l)dJ 


= ( 2 $! + 1 )( 2$ 2 + 1 ) 


8r 2 IkT 

h 2 


15-59 


When the nuclei are identical, there is a restriction on the allowed J 
values. For hydrogen in the ground electronic state, even J values are 
allowed for parahydrogen, with nuclear spin weight unity; odd J 
values are allowed for orthohydrogen, with nuclear spin weight three. 
If we consider hydrogen as being a single species, which is legitimate at 
high temperatures, we have the rotational partition function 

h*J(J+ 1) 1) 

/* = 3 E (2 J + l)e HE (2 J + 1)« ww 15-60 

J odd J even 


If we replace the suxranations by integrations, then, since the summations 
contain only one-half the possible terms, we obtain, according to 15-59, 
the result 


fa = (3 + 1) 


8 r 2 IkT 
2ft 2 


In any homonuclear diatomic molecule, with nuclear spin s, the statisti- 
cal weight of the ortho states is (s + l)(2s + 1); that of the para 
states is s(2s + 1). The rotational partition function will thus in 
general be 


{(s + l)(2s + l)+s(2s + l)} 


8-^IkT 

2ft 2 


(2s + l) 2 


8 * 2 IkT 
2ft 2 


15-61 


For all diatomic molecules, we may therefore write the rotational parti- 
tion function as 


f R — (2s i + 1 )( 2 s 2 + 


8r 2 IkT 

j <rft 2 


15-62 


where a, the "symmetry number,” is 1 for heteronuclear and 2 for 
homonuclear diatomic molecules; that is, a is the number of indis- 
tinguishable ways of orienting the molecule in space. 



296 * ELEMENTS OF QUANTUM STATISTICAL MECHANICS 


For polyatomic molecules, the analysis is similar to that given above. 
For high temperature, the result is 

, 8tt 2 (8 , ,, 

JR — ^3 + 1) 15-63 


where A, B, C are the principal moments of inertia and where the sym- 
metry number a is again equal to the number of indistinguishable ways 
of orienting the molecule in space; for example, a = 2 for H 2 0; a = 3 
for NH 3 ; <r = 12 for CH 4 and C 6 H 6 . The origin of the symmetry 
number is the same as in the diatomic case; if there are n equivalent 

orientations of the molecule in space, then only — of the possible energy 

n 

levels are allowed by the symmetry restrictions on the wave functions. 

15f. An Alternative Formulation of the Distribution Law. 1 Consider 
some definite atomic or molecular system A which is in thermal equilib- 
rium with a system B composed of s harmonic oscillators. Suppose that 
the combined system A + B possesses a total energy E. What will 
then be the probability that the system A has an amount of energy e 
distributed in some one exactly specified way among the various degrees 
of freedom of A ? 

In order to perform this calculation we make the usual assumption of 
statistical mechanics : Any exactly specified way of distributing the energy E 
in the system A + Bis as probable as any other exactly specified way of dis- 
tributing the same total energy. The probability of a partially specified 
distribution is consequently proportional to the number of exactly 
specified distributions which are compatible with it. Thus the probabil- 
ity of A having the energy e in some exactly specified way is proportional 
to the number of exactly specified ways of distributing the remaining 
energy ( E — e) among the s oscillators of the system B . But this is 


just the number of ways of distributing n = 


(E- €) 


quanta of energy 


among s oscillators, where hv is the energy per quanta. Since the quanta 
are indistinguishable, and since there is no restriction on the number of 
quanta in a given oscillator, the problem is similar to that already met 
in the discussion of the Bose-Einstein statistics. The number of ways 
iV»,« of distributing n quanta among the s oscillators is, analogous to 
Equation 15-21, 

N .(»+*- 1 ) ! .Kft. 

Nn -‘ “ n!(«- 1 )! 15,64 

This result can be derived quite simply in the following way. We count 
‘See E. U. Condon, Phys. Rev., 64. 937 (1938). 



DISTRIBUTION LAW 


297 


the number of different ways in which n quanta and s oscillators can b( 
arranged in a line so that there is always at least one oscillator on the 
extreme right; all quanta which are placed between two oscillators art 
considered as belonging to the oscillator to the right. There are 
(n + s — 1) ! such arrangements. Eliminating arrangements which 
have been counted more than once then leads directly to 15*64. 

Equation 15-64 is an exact answer to our problem but can be simplified 
when n and s are very large numbers. Using Sterling's approximation 

x\ = (27rx) H x c e^ x 15-65 


which is valid for large values of x , equation 15-64 becomes : 

If n » 8 we have 


n(n — l)(n — 2) /s — 


3! 


V 


-y 

n / 


+ 


a result which becomes strictly correct as n — > oo . Also, 


E -f- (s — 1 )hv 


i 


If the average energy per oscillator is we can write 
£+ (s - 1 )hv — sy g 

where g will be much smaller than sy provided that n is large compared 
with 8 and € is much smaller than E. In this case 

(E + (s - pAA *" 1 

hv ) 


(n + s - 1) 


8 — 1 


-Q 




c 


E + (s- Dhv V- 1 
hv ) 




(s - l)(s - 2) 

2 ! 
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_ ( E + (s - 1 )hv S j 
~ + (s — l)hv ^ 


1 - ' + # 


y ' 2ly 3 3!7 3 


s— 1 _ « 

e y 


Therefore 


N n , . « (2r> 


—H . 


(s 


- 1) S -H 
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y = Ce ■> 15-67 


where C is a constant independent of e. But, according to our original 
postulate, Nn t8 is just proportional to the probability P(e) that the 
system A have the energy e in an exactly specified way. Therefore, 
we have 


e 

P(«) = Ce” 15-68 

The constant y in 15-68 (which is the reciprocal of the $ in our earlier 
discussion) can be evaluated by calculating some property of the system 
A (such as the pressure if we let A be a perfect gas) and comparing with 
the experimental results. As mentioned before, such an analysis gives 
the value y = kT , where k is Boltzmann's constant and T is the absolute 
temperature. 



CHAPTER XVI 


THE QUANTUM -MECHANICAL THEORY OF 
REACTION RATES 


16a. Formulation of the General Theory. In previous chapters we 
were largely interested in the application of the principles of quantum 
mechanics and statistical mechanics to the study of those properties of 
chemical systems which are independent of time, that is, to the study of 
structural chemistry. These same principles can be applied successfully 
to the problem of the calculation of rates of chemical reactions. As the 
first step toward the solution of this problem it will be profitable to con- 
sider the method of representing reactions and changes in the state of a 
system generally by means of a geometrical picture. 

The state of any system is described in quantum mechanics by its 
eigenfunction >£( q , t), where t is the time and q represents all the coordi- 
nates which would be necessary, in classical mechanics, to specify the 
positions of all the particles in the system completely. With some 
reservations, these positional coordinates will be termed “ degrees of 
freedom,” although this notion is not so precise in quantum mechanics as 
in classical mechanics. (Since the observable properties of a mechanical 
system are completely determined by the energy of the system, it might 
be said to have only one degree of freedom, even though it consisted of 
many interacting particles. In the following discussion we shall use the 
term only in the sense of classical degrees of freedom.) Given the eigen- 
function SF, we can calculate all the physical properties of the system; if 
R is the operator which corresponds to the property in question, then 



JVr* dr 



161 


is the expectation value of the property, that is, the average value of a 
large number of measurements of the property. If ^ does not represent 
a stationary state, that is, if is a function of the time, these expecta- 
tion values will be functions of the time. 

The reaction between a hydrogen molecule and a hydrogen atom, such 
as occurs in ortho-para hydrogen conversion, constitutes a transition in a 

290 
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three-atom system. The properties of this system are completely deter- 
mined by the Schrodinger equation governing it, that is, by its eigen- 
functions. The system itself may be represented by a point in four- 
dimensional space: three dimensions are required to specify the relative 
positions of the nuclei, one additional dimension is required to specify the 
energy. (We assume here that the motion of the electrons is so rapid 
that the electrons form a static field for the slower nuclear motions.) 
The three internuclear distances r 12 , r 2 3, r 13 can be conveniently used to 
specify the configuration of the system. At some time to this system is 
found in a configuration in which, say, r 12 is much smaller than either 
7*23 or ri 3 ; that is, atoms 1 and 2 form a molecule. The reaction in 
question is represented by a transition from this configuration to one in 
which, say, r 2 3 is much smaller than either r 13 or r 12 . The probability 
of such a transition will be denoted by k. 

As the internuclear distances change in such a transition, the total 
binding energy of the system, that is, the electronic energy of the ground 
state of the system, will also change. The problem of determining the 
binding energy for any given values of the positional coordinates is 
identical with the problem of the calculation of the binding energy of 
stable molecules and is subject to the same limitations. The surface 
which represents the binding energy of the system as a function of the 
positional coordinates is known as the potential energy surface for the 
system; its determination is usually the first step in the theoretical dis- 
cussion of the rate of a chemical reaction. The general features of 
potential energy surfaces may be illustrated by the surface for the reac- 
tion H + H 2 . For simplicity we assume the three hydrogen atoms to be 
collinear; the system is then completely specified by the two coordinates 
r 12 and r 23 . If the calculation of the binding energy is made by use of 
the London formula (equation 13*28), assuming the coulombic energy 
to be 14 per cent we obtain the surface illustrated in Fig. 16* l, 1 where 
the lines of constant energy have been plotted as contour lines. The 
coordinate axes are inclined at an angle of 60° rather than 90°, since for 
this particular system this choice of axes diagonalizes the kinetic energy; 
that is, the kinetic energy is expressible as 2 

T -4(i Hf/t 

where m is the mass of a hydrogen atom. The equations of motion of 
the system, from the classical viewpoint, therefore represent the friction, 

1 H. Eyring, H. Gershinowitz, and C. E. Sun, J. Chem. Phys., 3 , 786 (1935). 

1 S. Gladstone, K. Laidler, and H. Eyring, The Theory of Rate Processes , p. 100, 
McGraw-Hill Book Company, 1941. 
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lees sliding of a mass point of mass m on the surface of potential energy. , 
(Strictly speaking, the analogy is not exact, since the motion of a mass 
point on a gravitational potential surface takes place in three dimensions; 
the discrepancy is not serious for our purposes.) We note that the 
potential energy surface is made up of two long narrow valleys, represent- 
ing the stable H 2 molecules, connected by a region of higher energy and 
separated by regions of still higher energy. The potential energy sur- 
faces for more complicated reactions are similar. The regions eorre- 



Fig. 16-1. Potential energy surface for three hydrogen atoms (after Eyring, 
Gershinowitz, and Sun). 

sponding to the configuration of the reactants (the initial state) will 
usually be separated from the regions corresponding to the configuration 
of the products (the final state) by a region of higher energy, in which 
event the reaction has an activation energy . Classical mechanics pre- 
dicts that, if the height of the barrier is V 0 , no systems with energy 
E < V o can react, and all systems with energy E > V o which approach 
the top of the barrier proceed to cross it and lead to reaction. If, as in 
the H + H 2 example, there is a shallow basin with a second pass just 
beyond the first barrier, a system may move into this basin and be 
reflected into the valley from which it came. Figure 16*2, 3 illustrating 
the classical motion of a system entering this basin, is interesting in that 

8 J. Hirschfelder, H. Eyring, and B. Topley, J. Chem. Phys., 4, 170 (1936). 
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it shows that the motion of a particle in the basin tends to become com- 
pletely random, and thus has an equal chance of leaving by either pass. 
In this case only about one-half of the systems which enter the basin 
will leave it through the pass into the valley corresponding to the con- 



Fig. 16*2. Path of representative point on H — H — H potential energy surface 
(after Hirschf elder, Eyring, and Topley). 


figuration of the products. These predictions of classical mechanics are 
modified when the problem is considered from the quantum-mechanical 
viewpoint. In the first place, there is a finite probability that a system 
in the initial state with energy E < V 0 may nevertheless appear in the 
final state at a later time. This phenomenon of “ leakage ” through the 
potential energy barrier is important in the decay of radioactive nuclei 




1 

1 

1 


I 

..1 

jH 

1 

1 

1 

m 


~ V 2 . 0 > y + 


Fig. 16*3. One-dimensional potential energy barrier. 

and in reactions involving only the transfer of electrons. In the second 
place, even if E > V 0 , there is a finite probability that the system will be 
reflected in the region of the barrier and will hence not proceed to the 
final state. 

The method of calculating these probabilities may be illustrated for 
motion in one dimension in the potential field of Fig. 16*3. The poten- 
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tial energy is 

F(x) = 0 — <»< x < — ^ 

2 

V(x) = F 0 - l ~<x< 1 - 16-3 

V(x) = 0 ^<*<+oo 

The Schrodinger equation for the system is 
d?i 8ir 2 m 

-£ 2 +- ¥ -[(E-V(x)}t = 0 16-4 

In region I, where V (x) = 0, i has the form of a plane wave (section 5- 1 ) , 
the two independent solutions corresponding to motion toward the 
barrier and away from the barrier. The general solution in this region 
is then 

ii = Ae™ + Be “*» 16-5 

where a = V2mE, and E is the total energy of the system. In 
h 

region II, where V (x) = V 0 , the solution is of the same form if a is 

replaced by j3 = V2m(E — F 0 ). When E < F 0 , /3 is imaginary, 

h, 

and in corresponds to an exponentially decreasing probability of finding 
the system in the region inside the barrier. In region III, ini has the 
same form as ii, and so we have 

in = Ce ilSx + De~ 0x 16-6 

ini = Fe*** + Ge~^ x 16-7 

The eigenfunction for the entire region must be continuous and have a 
continuous first derivative; therefore the following conditions must be 
fulfilled: 

dii diu l 

ii = in, — = — when x = - - 16-8 

dx dx 2 

d\bi it l 

in - iiih ~r~ = ~ — when x = + - 16-9 

dx dx 2 

These four conditions enable us to determine the constants B, C, D, and 
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F in terms of the constants A and G. The set of simultaneous equations 
is readily found to be 


Be fa 3 - Ce ** - De = -Ae ** 

. i i . i i 

- aBe - pCe = - a Ae “* 

Ce 0 * + = <3 e -< “S 

0Cc 2 - pDe ^ - aFe”* 2 = -a(?e w 2 

whence 


-Ae~™ 2 

— e 2 

— e 2 
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- -aAe ~ U “5 


fie 0 * 
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Ge~ ia t 

e 2 

e 1 
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— e*“2 
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-fie-** 

-ae* 1 * 
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e 2 
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xa-x 
— ae 2 
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.-i 

. 1 

Wo 

— e 1 
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— ae 2 


1610 


1611 


with the analogous expressions for C, D, and F. 

Let us suppose that there are no systems returning from the final 
state; that is, there are no systems in region III with negative momen- 
tum. In this case G = 0. Let the probability of finding a system in 
region I with positive momentum be unity; that is, let A = 1. Then 
\F\ 2 gives the chance that it be transmitted into region III and \B\ 2 
gives the chance that it be reflected back into region I. For E > V o, 
it is seen that the transmission coefficient |F| 2 is small but finite. More- 
over, it is a function of the variable 0 = 0i; that is, it varies with the 
original momentum of the particle, the height of the barrier, and the 
width of the barrier. When E > Vo, the transmission coefficient oscil- 
lates with these variables. Thus, for a fixed barrier height and width, 
particles of certain energies will be completely transmitted, while those 
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of slightly different energies will be less likely to cross the barrier, even 
though they have enough energy. 

The energy surface for an actual chemical reaction is always at least 
two-dimensional and is usually many-dimensional, since it must include 
a dimension for every internuclear distance for all the nuclei involved in 
the reaction. In every case, however, there will be some initial configu- 
ration in which the eigenfunction for the system can be well approxi- 
mated in one dimension by a plane wave traveling toward the region of 
configuration space which connects with the region of the products. 
Also, there will be a region where the eigenfunction for the configuration 
of the system representing products can be well approximated in one 
dimension by a traveling plane wave. Thus it will always be possible to 
expand the exact eigenfunction for the system so that it will represent, 
asymptotically at least, a plane wave which we shall call the transmitted 
wave, traveling away from the activated state down the valley repre- 
senting products. Then this representation will, in general, reduce 
asymptotically in the reactants valley to the superposition of plane waves 
traveling to and from the activated state, that is, an incident and a 
reflected wave. The ratio of the amplitude of the transmitted wave to 
that of the incident wave defines the transmission coefficient. Similarly, 
the ratio of the amplitude of the reflected wave to that of the incident 
wave defines the reflection coefficient. 

It often happens that motion in degrees of freedom orthogonal to 
that in which the reaction takes place is quantized, and the system 
exists in discrete energy levels corresponding to vibrations of the react- 
ants or products. These vibrational quantum numbers may or may not 
change during the course of a reaction. A change in the energy of some 
quantized degree of freedom requires a corresponding change in the 
energy of some other degree of freedom. Thus, reflection or transmis- 
sion at a barrier in the energy surface may induce a transfer of energy 
between the various degrees of freedom of the system. Let \f/ n represent 
the eigenfunction of the incident wave, where n denotes the set of quan- 
tum numbers specifying its vibrational state. Let p n be the trans- 
lational momentum of a wave in the vibrational state n. The reflected 
wave may have any vibrational state consistent with the available total 
energy. Let the eigenfunction of that fraction of incident systems 
initially in the vibrational state n which are reflected in state m be 
Rnm'l'm. Their momentum will be p m . Similarly, let T nm 'l'm be the 
eigenfunction of the wave transmitted with change in vibrational state 

from n to m. Then I^J 2 is the density of incident systems and — 

M 

their velocity. (M is their mass.) Since all systems are either reflected 
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or transmitted, and since there is no piling up in the region of the barrier, 
we have the relation 


l*»l 2 ^ = + £lr«*l 5 K M 


j Pm 


Vk 


16*12 


where the sums are over all values of m and k which are consistent with 
the total energy of the system. When the $’$ are normalized to unity, 

1 2 — = p nm is defined as the reflection coefficient , and the 


the ratio \R n 


Vn 


ratio | T n k\ 2 ~ = ic n k is defined as the transmission coefficient for the tran- 
sition from level n to level fe. 

Let us now formulate the rate of a chemical reaction in terms of these 
coefficients. We consider a region in the reactant valley which is far 
removed from the potential energy barrier. Let C n (p) be the number 
of systems in length of path dx in this region which have momentum 
between p and p + dp along the reaction coordinate and energy E n 
in the other degrees of freedom. The number of such systems which 
pass a given point in the reaction path per second moving toward the 
^71 (p ^ P P 

barrier will be , since — is the velocity of these systems, where 

dx m m 


p is considered to have only positive values, 
systems which react will be 

Cn(p) V 


E*nfe(p) 

k 


dx m 


The fraction of these 


16*13 


where K n k(p) is the transmission coefficient as defined above. Assuming 
an equilibrium distribution in this region, the concentration C n (p) will 
be 

Ui)(A 2 )--- -%$-*dxdp 


Cn(p) = 


F\F 2 


■ o>„e 


16-14 


where (Ai), (A 2 ), • ■ • are the concentrations of the reactants, Fi,F 2 ,''‘ 
are the partition functions for the reactants, and u n is the statistical 
weight of the level with energy E n . The net rate is obtained by summing 
16-13 over n and integrating over the momentum p. The general expres- 
.flion for the rate of a chemical reaction is therefore 


k = 


— / 16-16 
^ 0 h n Wh 

This expression cannot be further reduced unless certain simplifying 


(Ai)(A 2 ) 

F\F 2 
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assumptions are made. We obtain an approximation to the classical 

rp“ 

case if we assume « n k (p) = 0 for — - < Vo, K n k(p) equal to some average 

2m 


value k for > F 0 . Equation 16-15 then becomes 
2m 


k = 


(AiKAg) 

FiF 2 • • 

F 1F2 • • 



E n 

r 00 

/clXe 

* kT 
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kT 
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kT 

IZ-p kT 

h e 


2 mkT 


p dp 
mh 


16*16 


En 

The quantity kT has the general form of a partition function and 

n 

may be denoted by F l . We therefore have for the specific reaction rate, 
to this approximation, the result 


k f — k 


F x kT 


kT 


F l F 2 


h 


16*17 


This is the familiar equation originally derived by Eyring, 4 using a 
slightly different method of approach, and is a generalization of the 
results obtained by Pelzer and Wigner 5 in their study of the H + H 2 
reaction. In the derivation by Eyring’s method, it was shown that F x 
is the partition function for the activated complex (not including the 
degree of freedom along the reaction path). The investigation of the 
specifically quantum-mechanical effects in chemical reactions, as dis- 
cussed later in this chapter, have confirmed the validity of equation 16*17 
for most cases of interest. Most of the applications of the theory of 
absolute reaction rates have been based directly on equation 16*17; 
the results are discussed in detail in the recent textbook by Glasstone, 
Laidler, and Eyring. 2 

It will be noted that in equation 16*15 all explicit reference to the 
nature of the potential energy surface in the region of the barrier has 
disappeared; the effect of the barrier is contained implicitly in the trans- 
mission coefficients K n k (p ) . In this formulation of the theory there is no 
assumption of equilibrium between the initial and activated states. It 
is necessary to assume equilibrium between the various vibrational 
states of the initial configuration, which appears always justifiable, since 
the initial configuration can be chosen in such a way that the interactions 
between the reactant molecules are as small as desired. When reaction 

4 H. Eyring, J. Chem. Phys. } 3, 107 (1935). 

* H. Pelzer and E. Wigner, Z. phydk. Chem. t B16, 445 (1932). 
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induces transfer of energy from translational to vibrational degrees of 
freedom in the system, there will be, as we have seen, a different /c»*(p) 
and hence a different rate for each vibrational level in the final state. In 
general, the distribution of systems over the vibrational levels in the final 
state will not be the equilibrium distribution, but is given automatically 
by adding the rates of reaction for each level. Thus there will be a rate 
k' m for reaction in which the products are in the vibrational level m 
given by 

1 r 00 - p * 7) dr) 

k' m = — / Z« nm (p)u n e kT e 16 . 18 

Integrating this expression with respect to time will give the concen- 
tration of products in the rath vibrational state (disregarding changes 
after reaction) . This will be the equilibrium value only for special forms 

of K. 

The above formulation is completely general and applies immedi- 
ately to bimolecular and higher-order reactions. Unimolecular reac- 
tions require further consideration. They are usually considered to 
involve two steps, an activation by collision followed by a decomposition 
of the activated molecule. The first step, activation by collision, can be 
treated in principle by the method given above. Here we are interested 
in the calculation of the probability that systems approaching the 
barrier will be reflected with a change in vibrational energy. Analo- 
gously to equation 16*18, we have for the rate of formation of molecules 
in vibrational state l the result 

1 P°° in V » „ fa 

k[ = — / £p»i(l>)«»e"* r e HM9 

where p n i(jp) is the reflection coefficient. The rate of deactivation by 
collision is given by an analogous equation. Activated molecules can 
disappear either by decomposition into the products or by deactivation 
by collision. The usual theory of unimolecular reactions, as developed 
by Lindemann, Hinshelwood, Kassel, Rice, Ramsperger, and others, 6 
assumes that the concentration of activated molecules is constant. 
This steady-state assumption leads to the equation 

h(A)(N) = k 2 (A) (N*) +h(N*) 

where (N) is the concentration of normal reacting molecules, (N*) is 
the concentration of activated molecules, (A) is the total concentration 

8 L. S. Kassel, Kinematics of Homogeneous Gas Reaction , p. 93, The Chemical 
Catalog Co., New York, 1932. C. N. Hinshelwood, The Kinetics of Chemical Change 
Oxford Clarendon Press, 1940. 
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of molecules which can cause activation or deactivation by collisions. 
ki and k 2 are the specific rates for activation and deactivation by colli- 
sion, respectively, and k 3 is the specific rate for reaction of the activated 
molecule. The actual rate of production of the products will be 


k! = k z (N*) 


h h(A)(N) 
k 2 (A) + fc 3 


or, in the high-pressure limit 


k' = k -p (N) 

A?2 


The rate constants fci and k 2 are calculable by the equation given above; 
however, in the high-pressure region, where only a negligible fraction of 
activated molecules disappear by reaction, we may write 

fci _ (N*) _ F N * 

h (AO F n 

where Fn* is the partition function for activated molecules and Fn is 
the partition function for normal molecules. According to equa- 
tion 16-17, may be written as 

7 kT F n X 

h = k— — 


where FnX is the partition function for the activated complex of the 
reaction AT* — > products. (In FnX and Fn * the zero of energy is taken 
to be the ground state of the normal N molecules.) The high-pressure 
rate is therefore 


k' = 


k 2 


v* ) 


Fn h Fn* 


that is, all explicit reference to the activated molecules N* has disap- 
peared, and we are back to our familiar rate equation. 

There are two alternative ways of formulating the reaction rate prob- 
lem which are more useful for some purposes. In practice it is not 
possible to obtain the exact eigenfunctions for an n-body system when 
n is greater than 2 because of the fact that the equations of motion are 
not separable. They can be computed approximately, however, by first 
calculating the eigenfunctions for an approximate Schrodinger equation 
which is separable and then treating the terms which were neglected in 
order to make the separation as a perturbation on these approximate 
solutions. It is then found that the density of systems in any particular 
unperturbed level is a periodic function of the time. The perturbation 



310 


THEORY OF REACTION RATES 


induces transitions from one approximate level to another. For uni- 
molecular decompositions the approximate levels are those of the mole- 
cule which is decomposing, and the lifetime of an energy-rich molecule 
is determined by the probability with which transitions to continuous 
levels (corresponding to dissociation) are induced. Rosen 7 has used 
this method to calculate the mean life of a linear triatomic molecule. 
For more complicated molecules, even the separable problem is difficult 
to solve. Kimball 8 has treated the question classically by calculating 
the time elapsed before the amplitude of harmonic oscillations exceeds 
a critical value such that the energy of the oscillation is equal to the 
dissociation energy of the molecule. 

A third method which is often useful in treating problems where the 
rate is determined primarily by leakage through an energy barrier may 
be called the method of beats. If two single minimum potentials are 
connected by an energy barrier, it is found that the energy levels which 
lie below the top of the barrier split into two levels of slightly different 
energy as the width of the barrier is decreased. The amount of split- 
ting is small compared to the energy difference between pairs of levels. 
The corresponding eigenfunctions have different symmetry with respect 
to inversion in the origin. The one corresponding to the lowest level 
of any pair is antisymmetric, that is, it changes sign when the coordinate 
changes sign; the other eigenfunction is symmetric with regard to 
inversion. We then see that the function |^ a + $ 8 \ 2 is located almost 
entirely on the left side of the barrier; the function \\f/ a — ^«| 2 is located 
almost entirely on the right side of the barrier. The subscripts $ and a 
refer to the symmetric and antisymmetric eigenfunctions, respectively. 
We therefore represent a system moving from one side of the barrier to 
the other by the eigenfunction 

-?™Et Et 

* = f a e h a + he h ' 16-20 


The density is given by 

*** = + 'Pt'l'.e 


-^P(E.-E a )t 


+ M7e 


2irt* 


(E.-Ea)t 


When t = 0, 

*** = |*. + *.| 2 


and the system is on the left side of the barrier; when i = — » 

2 \E, — E a ) 

*** = \f a _ 


7 N. Rosen, J. Chem. Phys., 1, 319 (1933). 

8 G. E. Kimball, /. Chem. Phys., 5, 310 (1937). 
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and the system is on the right side of the barrier. The frequency 

_ 1 _ 2 (E a - Eg) 

P T h 


is therefore the rate of penetration of the barrier. As the barrier width 
increases, (E 8 — E a ) decreases and v decreases. This method is appli- 
cable only for small splitting of the energy levels and thus cannot 
be used to determine the rate of passing over the barrier. Of the three 
methods presented, the method of transmission coefficients, leading to 
equation 16*15 for the rate, is the most satisfactory from the theoretical 
viewpoint. The applications of the approximate equation 16*17 have 
been adequately discussed elsewhere; we wish at this point to investigate 
the behavior of the transmission coefficients tc n k{p) more closely. 

16b. General Behavior of the Transmission Coefficient. The trans- 
mission coefficients for a number of special types of energy barriers can 
be calculated exactly; this calculation has already been carried through 
for a sharp rectangular barrier. A less drastically idealized case has been 
considered by Eckart. 9 The Eckart potential is 


V(x) = 


AH Bt 

l-s a-*) 2 ’ 5 


16-21 


and the corresponding Schrodinger equation has exact solutions in terms 
of hypergeometric series. This potential is a step function of the coordi- 
nate x; its exact shape is fixed by specifying the constants A, B, and l. 
We have 

F(— oo)=0, V(+*)=A 16-22 


For 0 < |B| < |A|, the potential has a maximum whose height is 


(A + jB ) 2 
45 


16-23 


The rise in potential is accomplished practically in the distance 2Z. 
The Schrodinger equation with this potential is 


dV 

dx 


2+ h 2 


8ir 2 m \ 


+ 




i - r a - i? 


E\ \p = 0 


Changing variable from 2 to $ gives 
dV # 2ml 2 [ M 

_ dl 5 + * df + h 2 U - e 

9 C. Eckart, Phys. Rev., 35, 1303 (1930). 
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(1 - f) 2 


16-24 


+ E\1"~ 0 16-25 
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This equation is of the hypergeometric type and has solutions in the form 


\ • u d(a -f- 1) * b(b + 1) 2 

F (a. b 9 c, y) = 1 *4 y H y 2 4- 

ryii f u f v,yj T t _ y “T j . 2 • c(c + 1) ^ 


Since for large positive or large negative values of x , the potential 16*23 
reduces to a constant, the solutions 16*26 should be asymptotic to plane 
waves with wavelength 

* h * 

X — ■■ — - for X — ► — 00 

v2mE 

and 


for x — » + *> 


V2m(B - A) 

At x = + oo there will be a single transmitted wave: 

2-nix ,l_ 

e v = (-£) v 16-27 

At x = — .oo there will be an incident and a reflected wave: 

2 xis 2t*x .1 A 

a\e x + a 2 e x = ai( — £)* x + a 2 ( — {) * x 16-28 

The requirement that the exact solution reduce asymptotically to the 
forms 16-27 and 16*28 determines it uniquely and also detennines the 

a 2 2 

coefficients cq and a 2 . The reflection coefficient is then p = — • 

di 

A solution of 16-25 which converges for large values of ((x > 1) and 

•I 

reduces asymptotically to ( — £) * x ' as |{! becomes very large is given by 


*«(!-*) 


where 


W4 


+ i(a -0 + 8), 


I + i(a -0-8), 1 - 2i0, 


1 - £l 


l 1 IE l if 

a ~ X ~ 2\ C ’ 13 ~ X' ~2\ 


E - A 

C 




; c = 


That this reduces to (— £)*^ for large £ is evident since F(a, b,c, 0) = 1. 
Its value for small £ cannot be determined immediately, however, since 
F(a, b, c, 1) does not converge. But it is possible to express 16-29 as the 
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sum of two series which do converge for small values of We thus find 
the analytic extension of in the region where its representation by 
16-29 does not converge. 

Using the known formula, we get 


* = 




+ *(« -/* + «), 


~h + »(« — j 8 — S), 1 + 2 ia, 


+ (j-ri) ~ (i - «)**■ {i + *(-« - 0 + *), 

— 2 + *"(—<* — — 5), 1 — 2ta, 

where 


r(l - 2tj?)r( — 2ia) 

r{| + *(-« - /} - 5)jr(i + *(-« — /» + «)} 

r(l - 2ifi)T(+2ia) 

r{J + *(« - /3 - a) } r{£ + t(« - £ + a) } 


f- l 

16-31 


16-32 


For small values of £, equation 16-31 reduces to equation 16-28 with the 
values 16-32 for ai and a 2 . The reflection coefficient is then given by 



G>2 

2 

r{£ + »(5 — 0 — a)}r{^ + i(— 5 — f) — a)} 

p = 

(Li 


r{f + t(i - J8 + a)}r{| + <(- 6 - fi + a)} 


When the constant C is small, 5 is real and applies when the wavelength 
of the incident particle is short compared to the region where potential is 
changing. If the wavelength is long compared to the length l, C will be 
large and 5 will be imaginary. The expression for p is expressed differ- 
ently in the two cases. For 5 real, we have 

cosh 2x(a — |8) + cosh 2x5 

— . N * ' 1 O i 

cosh 2r(a + fi) + cosh 2w8 


For 5 imaginary, 

cosh 2r(a — ft) + cos 2 tt|5| 

P cosh 2w(a + P) + cos 27r|s| 

The transmission coefficient is given by k — 1 — p. 

This value of p or k may be used for any potential barrier which is 
sufficiently well represented by equation 16*23. The general method, of 
course, is applicable whenever the exact solutions of the Schrodinger 
equation corresponding to the given potential are known. However, 
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many barriers are flatter than that represented by equation 16*23. The 
question then arises: Under what conditions is it permissible to treat the 
rising side and the falling side of the potential as independent changes, 
each having a reflection coefficient given by 16*34 or 16*35? This 
question has been investigated by Hirschf elder and Wigner. 10 Any 
irregularity in the potential surface, besides inducing transitions in the 
reflected and transmitted waves, also sets up non-propagating “ diffrac- 
tion patterns ” which may be considered to be exponentially damped 
standing waves in the vicinity of the irregularity of the potential. If 
the top of the barrier is wide enough so that these diffraction patterns 
do not overlap, the transmission coefficients at the two edges may be 
considered independent. The progressive wave representing the react- 
ing system on the top of the barrier then loses all traces of the effects of 
one edge before it reaches the other. It then exists in a fairly well- 
defined quasi-stationary state while it is on the top of the barrier. 
Under these conditions, it will be permissible to treat the activated 
complex as a definite molecule with properties which can be calculated 
from the shape of the potential at the top of the barrier. The potential 
surface must be flat enough so that the density of reaction complexes, as 
given by the Boltzmann factor, is sensibly uniform over it, and the flat 
portion must be wide enough so that the uncertainty in the velocity due 
to the relationship Ap Ax ~ h is negligibly small. If the potential along 
the reaction path through the activated state is approximated by the 
parabola 

V = V 0 -ax 2 

these conditions may be expressed in the inequality: 

hn ‘~i^ kT 16 - 36 

where M is the mass of the complex. It appears very probable that 
most reactions satisfy this criterion, and it is consequently justifiable to 
treat the transmission coefficients at the two ends of the activated state 
as independent. 

We may suppose that there is a probability p»* that a system entering 
the activated state will be reflected and a second probability p/ that a 
system leaving the activated state will be reflected. The transmission 
coefficient k is the fraction of those systems in the activated state moving 
from the initial to the final state at thermal equilibrium which originally 
came from the initial state and which will proceed directly to the final 
state without returning to the activated state. At thermal equilibrium, 

10 J. Hirschfelder and E. Wigner, J. Chem. Phys., 7, 616 (1930). 
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let there be N systems in the activated state, A of which arrive ip unit 
time from the initial state, N — A =5 of which arrive from the final 
state. Of the A systems, Ap / will be reflected at the boundary of the 
activated state and A (l — p /) will be transmitted* Of those reflected, 
ApiPf will be reflected at the other edge of the activated state and will 

Initial Transition Final 

state state state 


A w ! A A(l-P/) 



Pi Pf 

Fia. 16*4. Calculation of transmission coefficients (after Hirschfelder 

and Wigner). 


recross it toward the final state. Figure 16-4 illustrates the flux of 
systems of various types. Adding up the number of systems crossing 
from left to right, we find the total number to be 

Nt^r = A (1 + PiPf + pbf + •••)+ Bpi( 1 + PiPf + pbf + ••■) 

= (A + Bpi) (1 — ptP/) -1 16-37 

Similarly, the number crossing from right to left is 

N r ->i = (A P/ + £)(!- PiP,)- 1 16-38 


At equilibrium, equal numbers of systems will be moving in each direc- 
tion, that is, 2V{_, r = A r _,j, so that 


B = 


A (1 — Pf) 

(1 - pi) 


16-39 
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Substituting this expression into 16-37 gives 


= 


A 

(1 - Pi) 


16-40 


But the number of systems originating in the initial state which proceed 
to the final state is, by addition, 


Ni^f = 


M 1 -p/) 

(1 - Pip / ) 


' 16-41 


and the transmission coefficient is the ratio 

_ Nj-tf _ (1 — p»)(l — p/) 
~ Ni — (1 - PiPf) 


1642 


This expression for k is the average for a great many systems in thermal 
equilibrium, whose energies consequently differ slightly. The trans- 
mission of a single quantum state will depend critically upon the energy 
of the system, as we shall see, but averaging over a range of energies 
gives an average transmission coefficient which agrees with equa- 
tion 1642. 

We can complete the calculation of k for the Eckart potential. Taking 
B =s 0 reduces the barrier to a simple step of energy A with no maximum 
in V . For this special case the reflection coefficient is 


P = 


2x1 


(p-q) 


12 


- 1 


2xl 


<P+fl) 


- 1 


4 W 

6 A* 


1643 


where p = V2 mE and q = V! 2m {E — A) are the momenta in the initial 
(or final) and activated states, respectively. For an abrupt energy 
change, l == 0, and the transmission coefficient becomes 


4 qPiPf 

(<f + ViVf) (pi + vs) 


16-44 


where q , and p / are the momenta of the system in the initial, acti- 
vated, and final states, respectively. This value for k may be compared 
with the exact result for a rectangular barrier, as determined earlier in 
this chapter. If we take the expression analogous to 164 1 and reduce the 
determinants, we find for the transmission coefficient the result 

VP.?/ 

K Q 2 (Pi + Pf) 2 cos 2 <p + (q 2 + PiP/) 2 sin 2 <p ’ V h 


where d is the barrier width. If this equation is averaged over <p f that is, 
over a range of momenta in the activated state, the result is identical 
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with equation 16-44 for it. * covers a cycle in its oscillation as <p goes 

h 

from 0 to 2v or as q goes from 0 to - . Thus, for wide barriers, the oscil- 

d 

lation in k as the momentum in the activated state is increased becomes 
more rapid, and averaging over a small range of momenta smooths out 
the oscillations more completely. Thus the treatment of p,- and p/ 
as independent is justified in a statistical sense as giving the average 
transmission coefficient over a band of energies when the barrier is 
sufficiently wide. 

In an n-atom system a decrease in momentum as the system passes 
from the initial to the activated state means that some of the initial 
translational energy has been converted to vibrational energy. To 
consider such a possibility, we must treat the problem in more dimen- 
sions. An abrupt increase in potential in a straight channel of para- 
bolic cross section will illustrate the essential features of the two-dimen- 
sional case. Let us designate by X the coordinate along the channel, 
and that perpendicular to the bottom of the channel by x. We consider 
an abrupt change in the potential at X = 0, i.e., for X < 0, let F,- — 
A -f- a;x 2 ; for X > 0, let F/ = a/x 2 . In the left-hand region, the 
Schrodinger equation 

H {ipk = h'Pk 

with potential F„ must be satisfied. On the right side of the potential 
drop, the equation to be satisfied is 

H f<pk = FkVk 

with potential F /. The <p ’ s may be expanded in terms of the \p’s as 

<Pk(x) = T,uikMx) 16-46 

An incident wave from the left has the form fi(x)e h 1 and gives rise 

to a reflected beam in which the systems have all possible quantum 
numbers m for their motion in the x direction. Thus the wave function 
for X < 0 is 

2 ri oo 2 t< y 

+ £ RirAme h 16-47 

*»■*() 

and the total energy of the system is given by 
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For X > 0, there is only an outgoing wave 


00 2tt» 

Z Tik<Pke h Pk = Z TikUkmtme h 

k*=0 k, m 


and the energy is 


Pk 

E = — +F k 
2M ~ 16 


1649 


16*50 


For high quantum numbers, the momenta and qk are imaginary, and 
the corresponding wave function represents a local non-propagating 
disturbance in the neighborhood of the discontinuity. These imaginary 
momenta, when divided by i ) must be positive to represent exponentially 
decreasing disturbances. 

Both the wave function and its derivative must be continuous at 
X = 0. Equating 16*47 and 16*49 and their derivatives with respect 
to X at X = 0, and comparing coefficients of gives 


dim “f“ Elm — ZTlkUkm 

k 

hmqi ~ Rlmqm ~ Z T Ik^kmVk 

k 


16*51 


These equations may be solved for Ri m and T\k by the methods of matrix 
algebra. The results are 


r = (*- ufc + ir 1 

T = 2£(£ + l^vT 1 


16*52 


where £ is the infinite square matrix £ = qu x p l u. From the wave 
function 16*47 we see that the number of systems with quantum num- 
ber l which are incident in unit time is 77 » that is, density times velocity. 

M 


Similarly the number reflected with quantum number m is 


\Rlm\ 


2 

M 


so 


that the probability of reflection with quantum number m is \Ri m \ 2 — • 

Qi 

In the same way we see that the probability of transmission with quan- 

turn number k is 1 7^ 1 2 — • 

Qi 

Equations 16-52 are only a formal solution, since we have not evalu- 
ated the matrix elements of £ . This can be done for the case where the 
total energy E is much greater than the energy of the highest vibra- 
tional state Ft to which there is likely to be a transition. This means 
that the translational energy in the final state will be large compared 
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with the vibrational energy, though this need not have been true in the 
initial state. The results of this calculation show that the reflection 
coefficients for no change in vibrational quantum number are slightly 
smaller than those calculated for one dimension, owing to the possibility 



Fig. 16-5. Probabilities of reflection and transmission at abrupt drop of 9 kcal. in 
potential energy when vibrational frequency changes simultaneously from hv 4 = 4 kcal. 
to hvj — 2 kcal. (after Hirschf elder and Wigner). 

for reflection with simultaneous change in vibrational quantum number. 
To the approximations made, such changes can be either 0 or ±2 for 
the reflected wave; the transmitted wave may have any quantum num- 
ber differing by an even integer from that of the initial state. 

In Figure 16*5 the calculated transmission and reflection probabilities 
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are plotted. It is evident that the probability of a vibrational transi- 
tion is not large and that it decreases with increasing initial translational 
energy. It thus appears that the interchange of translational and 
vibrational energy has little effect on the transmission coefficient, at 
least as long as the translational energy is large in the final state. 

The effect of curvature of the reaction path on the transmission 
probability may be investigated in the same manner. A convenient 
energy surface to consider is that formed by the channel between two 



Fra. 16-6. Hyperbolic reaction path. 


confocal hyperbolas, Figure 16-6. The potential energy inside the 
channel is taken to be V = 0; outside the channel, V = °o. The 
Schrodinger equation is separable in the elliptic coordinates /» and <p 
given by 

x = R cosh m sin ^ y - R sinh /i cos <p 16-53 

into two equations: the equation in n describes the motion parallel to 
the reaction path, and the equation in <p describes motion perpendicular 
to the reaction path. These equations are 


£U 8* 2 MR 2 
dn 2 h 2 


/E 

( — cosh 2/i 


U 


= 0 


d 2 $ 8ir 2 MR 2 
dv 2+ h 2 



$ = 0 


16-54 

16-55 
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where R is the focal length of the hyperbolas, E is the total energy of the 
system, and X is the separation parameter to be determined by the 
boundary conditions. The lines <p = <po are the hyperbolas 

-4 4 - = & 16-56 

sm cos <pq 


dy 

whose asymptotes have the slope — = cot <p 0j so that is the angle 

ax 


between the asymptote and the y axis. If the angle between the 
asymptotes of the two bounding hyperbolas is denoted by A, the bound- 
ary condition is that $ = 0 on the two hyperbolas whose asymptotes 
A A 

are given by^> = <po + ~ and <p = <po — ~ • 
z z 


Equation 16*55 is now solved for $>, and the allowed values of X are 
determined. When these values are substituted into equation 16*54 
it becomes a one-dimensional equation for the translational motion of 
the system through the channel. For low values of E and for A < 30°, 
the characteristic values for X may be developed in a power series in 
A. 11 Up to terms in A 2 , the allowed values of X are 


X = 


h 2 n 2 
8 MW 2 


2 E o 

COS (po — — cos 2<P0 

z 


16*57 


where W is the width of the channel at its narrowest point. With this 
value of X, equation 16*54 is a one-dimensional equation for motion 
parallel to the reaction path <p 0 . It is not in the usual form, since the 
energy E is multiplied by a function of the coordinates. Making the 
transformation 

X = TJ\/ sinh 2 n + cos 2 <po 

16*58 

s = R I Vsinh 2 m + cos 2 <p Q dp 


equation 16*54 becomes 

d?x . 8tt 2 M 
ds‘ 

where 


2 + “P (* - V„ - F x ) x = 0 


16-59 


- 


h 2 Tf ~ i cosh 2 2y + 2 cosh 2/x cos 2<p 0 
8t 2 MR 2 L (cosh 2 fx + cos 2*> 0 ) 3 


] 


11 R. Longer, Trans. Am. Math. Soc ., 36, 637 (1934). 
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h 2 n 2 cos 2 <pq 

. X ~ 8 MW 2 • \ (cosh 2 m + cos 2#> 0 ) 

The coordinate s is measured along the reaction path with s = 0 at the 
narrowest point of the channel. 

Classically, every system which began the journey through the hyper- 
bolic channel would get through. However, equation 16-59 shows that 
the quantum-mechanical wave packet encounters an effective barrier. 
This barrier may be ascribed to two causes. The largest part of it is F\, 
which arises from the increasing zero-point energy of vibration as the 
channel narrows. In order to get through the neck of the channel, the 
system must have sufficient translational energy initially to overcome 
the increased zero-point energy of vibration in the narrow neck. The 
second part of the effective barrier is much smaller, of the order of 
0.3 kcal. per mole at most. At large distances, the potential F w is 
attractive, but it goes through a maximum at the activated state, adding 
to the effect of F*. It arises from the transformation of coordinates 
made in equation 16-54 and may be considered analogous to the centrifu- 
gal potential which appears in discussions of the motion of a particle 
which is constrained to move in a circular orbit. From the wave view- 
point, the attractive minimum corresponds to a focusing effect similar to 
that observed with light reflected from concave mirrors. In regions 
where F w is low there is a concentration of systems, due to the Boltz- 
mann distribution, similar to the concentration of light intensity at some 
points in front of a curved mirror. 

We may conclude from this example that the effect of curvature of the 
reaction path is to introduce a virtual potential barrier at which there 
will be a reflection probability given by equation 16-44 or 16-45. How- 
ever, it is important to note that for a hyperbolic channel there can be no 
interaction of vibration and translation, since the Schrodinger equation 
is exactly separable in the coordinates corresponding to these motions. 
This, as we have seen, is never completely realized in actual reactions, 
although for many reactions it may be a good approximation. 

The transmission coefficient for a surface tfhere vibrational transitions 
do take place may be calculated for a channel having vertical parallel 
walls which makes a 90° turn. Here, as in dealing with the straight 
parabolic channel, it is necessary to join solutions of the Schrodinger 
equation for each of the three regions shown in Figure 16-7. If the 
width of the channel is l and the origin of coordinates is at the outside 
comer, then, for x > l, 0 < y < l, there is an incident wave moving to 
the left on which is superposed a number of reflected waves moving to 
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the right, so that the wave function is 

2r% 7 oo 2 ir % * 

= e * Ptx sin y y + £ 5 w e 4 sin y y 16-60 

l y-o » 

For 0 <x<l, 0 <y<l, the wave function must satisfy the boundary 
conditions 

Hx,o)=HO,y) = *(U) = o 

Such a function is 



Fig. 16-7. Reaction path with 90° turn. 


In region III, where y > l, 0 < x < l, there are only the outgoing waves 


2 T» • 


’ B in^x 


At x = 1 we must have 


*5(i,v) -irtF&y); 


#*1 


Similarly, at y = I we must have 


tffeO -*?'(*, 0; f] -fl 

dy ly-l ty J„-l 

If we expand sin —7-^ x in terms of sin — x, we can write down the four 
h l 

|i 

equations indicated above and compare coefficients of sin y x. This 

i 
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2 ri f 

— piLRkje 


f . 2irpj 
Rkj = Akj sm — i 

rt , 2iri . ”1 


2 ri . 
. — 


2iri x p#i 

— py7*ye * = 


27r %7rpj , , _ . . N „ 7rri _ 

= — Vi COS -y- J + E(-l) n y fifc„C„y 

2’fcy = Bjty sin ^ Ze-^' p,i 16-65 

/l 

2x 27rpy _ , ,„*n 

Rkj , Vi COS ^ "T ( 1) . A kn C n j 

h n . l 


where 


_ 2 r l 
~ lJ 0 


. 2irp } . irj 
sm — - — x sm — x ax 
h l 


The solution of these equations gives for the matrices of the coefficients 
A k j and B kj the results 

A = — 2t(l - G 2 )- 1 

B - — 2i(l - G 2 )~'G 16-66 

where G is a matrix with elements given by 


2 tt - 2 -p PI ,, 


G k i = 


(— 1)* +J 2ir 2 kj sin — p k ie h 


For a given total energy p k can be calculated for different values of k 
and the matrix elements evaluated. Equations 16*65 then give the 
amplitudes of the reflected and transmitted waves. 

pd 

Inspection of equation 16*67 shows that, if is equal to some 

h 

integer n, there is no transmission. Such energies correspond to reso- 
nant frequencies for region II. The transmission coefficient is conse- 
quently an oscillating function of the energy of the system, as it is for a 
one-dimensional barrier. Figure 16*8 shows the density of transmitted 
systems in the first and second vibrational states if the incident wave is 
in the lowest vibrational state. In contrast to the results for a straight 
channel, it is seen that the probability of a vibrational transition upon 
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turning the comer is quite high. That this is not a peculiarity of the 
sharpness of the corner is evident from the classical consideration of two 
straight channels joined by a circular section, as in Figure 16*9. Let 



Fig. 16-8. Density of transmitted systems for 90° turn. 


the last collision with the walls of the straight section be just at the 
point where the circular section begins. Inspection of the geometry 
of the figure shows that, unless the circular channel has a certain very 
definite length, the component of velocity perpendic- 
ular to the reaction path will be altered in passing 
around the corner. This is equivalent to a change 
in the vibrational quantum state of the system. 

From these results, we are led to the conclusion 
that, though a barrier does not very often cause a vi- , 
brational transition in the reacting system, a bend in » 
the reaction path nearly always induces transitions 
to the highest vibrational state consistent with the 
total energy of the system. A system moving slowly 
in the initial state has less chance of surmounting 
the barrier but more of turning a comer than one 
with a high initial velocity. Actually, it is seldom 
that the sharpest bend in the reaction path occurs just at the activated 
state; usually it occurs just after the system has passed through the 
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activated state and consequently is moving slowly. It thus seems 
unlikely that large reflections due to a bend occur in practice. 

It will be noted that the energy unit in Figure 16*8 is inversely pro- 
portional to the effective mass of the reacting system. Consequently it 
will be different for different isotopic compounds. Thus if 5 2 = 3 for 
the complex H + HC1, it will be 6 for D + HC1. The transmission 
coefficients are markedly different. The extent to which this difference 
is realized in practice depends upon the degree to which the maxima in 
the transmission coefficient average out as the result of the thermal 
energy distribution. For reactions involving light atoms, the zero-point 
energy is so high that averaging over the thermal spread may not affect 
the peaks to any great extent. However, the potential used in calculat- 
ing Figure 16*8 is highly idealized. If the incoming channel is narrower 
than the outgoing the width of the peaks is much diminished. Further- 
more, any smoothing of the sharp corner ought to increase the rapidity 
of oscillation of the transmission coefficient, so that it appears that only 
exceptionally should there be any noticeable experimental result from 
these peaks. In reactions involving isotope separations, the slight differ- 
ences in transmission coefficient of the isotopes may well be important. 
It should also be observed that the classical transmission coefficient 
rapidly becomes equal to the average of the quantum-mechanical trans- 
mission coefficient. For all ordinary cases it thus appears justifiable to 
calculate the transmission coefficients on a classical basis. 

16c. Transition Probability in Non-Adiabatic Reactions. 12, 13 We 
have considered so far only those reactions which proceed on one energy 
surface. Frequently, however, two such surfaces come very close 
together or, in fact, appear to cross each other. Actually, at the point 
of the apparent crossing the system is degenerate, since the two different 
electronic configurations have the same energy. This introduces a 
resonance energy which separates the surfaces slightly so that they never 
actually intersect but only approach each other closely, as has been 
explained in section lid. This state of affairs is typified by a diatomic 
molecule which has both homopolar and ionic states. Figure 16*10 is a 
plot of the energy as a function of the interatomic distance for such a 
molecule. The electronic eigenfunctions are fa and fa. For r > r 0 , 
pi is ionic in nature and fa is homopolar; for r«r 0 , the roles are 
reversed. If the molecule is initially in state fa (homopolar) and the 
tnternuclear distance r increases infinitely slowly, the molecule will 
remain in state fa for r » r 0 . But if r changes with a finite velocity, 
there will be a finite probability that the molecule will change from 

18 C. Zener, Proc . Roy . Soc., London , A137, 696; A140, 660 (1933). 

18 L. Landau, Physik. Z . Sowjetunion, 2, 46 (1932). 
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$2 to as it passes r = r 0 so that its final electronic state will be repre- 
sented by a linear combination 

= Ai(r)^ + A 2 (r)yp 2 16*68 

For convenience in calculating Ai and A 2 , however, we shall express 
and \j/ 2 in terms of two other wave functions <pi and <p 2j defined so that 



(pi is equal to for r r 0 , that is, ionic in this example, but <p x remains 
ionic in character for all r. Similarly, (p 2 is equal to for large r (homo- 
polar) and remains homopolar in character for all r. The energies € X 



and e 2 corresponding to <pi and <p 2 intersect when plotted as functions of 
the internuclear distance r (Figure 16*11). Consequently <p x and <p a 
are not exact eigenfunctions of the complete Hamiltonian for the 
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system, and their eigenvalues ei and e 2 are only approximate; actually 
they are eigenvalues for the Hamiltonian which does not include the 
interaction energy € i2 between the two states <p i and <p 2 • At the point 
of crossing we have E i = *i — e i2 and E 2 = e 2 + e 12 , where e\ 2 is the 
difference between the exact eigenvalues Ei and E 2 and the approximate 
eigenvalues ei and e 2 . If we choose <p\ y <p 2) to be normalized and 

orthogonal we must have 

^1 = ^ (<Pl + <P2) 

Then, since 

H^i = E\\p\\ == E 2 \f/ 2 16*70 

we readily see that 

H^i = ei<pi — e 12 <p 2 

16*71 

H<p 2 = e 2 <p 2 — e 12 <pi 

We now consider the state of the system in terms of <pi and <p 2 as it 
passes through r = r 0 . Suppose that initially it is in the state \p 2 . The 
final state will be given by equation 16*68. But this can also be ex- 
pressed in terms of <p± and (p 2 , if we consider the state of the system to vary 
with the time rather than with the internuclear distance. <pi and <p 2 
are not functions of t ; they do not, however, represent steady states, 
since systems are jumping from (pi to <p 2 at the crossing. We must 
therefore write for the final state of the system the expression 

2 *i' t m . 

= ci (t)e h * l <px(r) + c 2 (t)e h n <p 2 (r) 16*72 

At any time t 0 , there are |ci(fo)| 2 systems in the state <pi and |c 2 (£ 0 )| 2 m 
the state <p 2 . It is thus necessary to consider the time rate of change of 
Ci and c 2 at any convenient fixed position r x . Since we know the rela- 
tion of <pi to i and cp 2 to ^ 2 for r » ro, it is convenient to take at oo . 
We now study the time variation of \p given by 16*72, using the time- 
dependent Schrodinger equation: 

( h |9\ f 2W 2 ri 

H- — — J|ci(<)e A 11 * (r) + c 2 (t)e * % 2 (r) 

Substituting equation 16-71, satisfied by <pi and <p 2 , we have 
h dci 


fa = V2 ^' Pl ~ ^ 


16-69 


= 0 16-73 
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These equations must be solved simultaneously subject to the boundary 
conditions 

Ci ( — oo) = 0 |c 2 ( — °o)| = 1 16*75 

which correspond to our knowledge that initially, t = — oo , for r » r<>, 
the system is in the state <p 2) equivalent to \p 2 . The probability of a non- 
adiabatic transition is |Ai(r »ro)| 2 in equation 16*68. Denoting this 
by P, we have 

P = |Ai(r»r 0 )| 2 = |c 2 ( 00 )| 2 = 1 — |ci(oo)| 2 16*76 


Equations 16*74 need be solved only for their asymptotic values of Ci 
and c 2 . Eliminating c 2 from 16*74 gives 


d 2 Ci 

dt 2 + 


2 iri 

T 


(«1 - «2) 

«12 



16*77 


We now simplify this equation by means of the following assumptions. 

(a) €i 2 (ro) the relative kinetic energy of the two systems. When 
this is true, the motion of the centers of gravity of the two atoms is so 
fast that this motion will have a negligible effect on e i2 during a crossing, 

( b ) The transition region is so small that we may regard ei — c 2 
as a linear function of the time, and « 12 , <pi, and <p 2 as independent of the 
time. This is true if e 12 is sufficiently small. 

We therefore write 

2 7 T 

y (*1 - € 2 ) = at 

a a a 16*78 

P€j 2 _ d(pi d(p 2 _ 

dt *" dt ~ dt 


This means that ei (r) and e 2 (r) are the asymptotes to the curves Ei (r) 
and E 2 (r) in the region of the crossing. The smallest separation of Ei 
and E 2 is Ei (r 0 ) — E 2 (r 0 ) = 2e 12 (r 0 ). 

The assumptions 16*78, together with the substitutions 


2tT€i2 

h ; 


Ci = e 




Ux 


16*79 


reduce equation 16*77 to the form 

d 2 U i 
dt 2 


+ 


(r- f +f ?) 


c/i 


16-80 


The solution of this differential equation, subject to the boundary con- 
ditions 16-75, gives for the asymptotic value of |cj ( » ) | 2 the result 

| cx (°°)| 2 = 1 - e~ 2Ty 


16-81 
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where 


7 = 


2 7T 


€12 




Therefore 

P = e~ 2lry 

The denominator in 16-82 can be expressed as 


dt 


dr . 


(«1 € 2 ) 


= y 


^€i $€ 2 

dr dr 


= F|si - s 2 


16-85 


16-8£ 


16-84 


where T = ~ is the velocity with which a system crosses r = r 0 anc 
dt 

|ai — ^2 | is the difference of the slopes of the two crossing potential 
surfaces at r = r 0 . We have 

P = 6 16-85 

for the probability of a transition, that is, of “ non-adiabatic ” behavior 
The probability that a system stay on the initial energy surface is ther 


4* 2 «12 

p' s- i — e 


16*8f 


16d. Thermodynamics of Reaction Rates and the Effect of Applied 
External Forces. Returning to equation 16-17, we may formally 
write it 


F x 


,, kT 

K — K — 1 

h FiF 2 


Vo 
kT . 


h 


16-87 


where K* is the constant for the equilibrium between the activated 
complex and the reactant molecules. Using the thermodynamic relation 
— A F — RT log K, we may write 16-87 as 

kT 

k' = K — e RT 16-88 

h 


where A F* is the difference in free energy between the activated and 
normal states and is called the “ free energy of activation.” Since 
A F = AH — TAS, we also have 

kT 

k' — K — - e ~ RT g R 


16-89 
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Equations 16-88 and 16*89 have proved to be very useful in the interpre- 
tation and correlation of experimental data. 2, 14, 15 

Equation 16*88 may readily be extended to include the effect of an 
applied external force. Let A F* be the free energy of activation for a 
given process in the absence of the external force. Then if an applied 
force has a component / along the reaction path, and if this force acts 
x 

through a distance - from the normal to the activated state, the free 

/x 

energy of activation will be decreased by an amount — . The specific 
forward rate in the presence of the external force will thus be 


, kT - 

kf = K—e 
h 


fx 

k 0 e 2kT 


where ko is the rate in the absence of the external force and N is 
Avogadro’s number. 

In the same way there is a backward rate against the field of 

fx 

k b = koe 2kT 16*91 

This gives a net rate of 

/ JfL. _Al\ fx 

k f = kf — k b = k 0 \e 2kT — e 2kT ) = 2 k 0 sinh^~ 16*92 

and hence a net forward velocity of 


\k r = 2A&o sinh 


Here X is the average distance traveled per jump, which may, or may not, 
equal x. 16 

Elaborations of 16*92 and 16*93 have been successfully applied to the 
study of a wide variety of physical problems, including viscosity, diffu- 
sion, plastic deformation, and electrochemical phenomena. 2 

14 M. G. Evans and M. Polanyi, Tram. Faraday Soc., 31, 876 (1935). 

16 W. F. K. Wynne-Jones and H. Eyring, J. Chem. Phys 3, 493 (1935). 

16 H. Eyring, J. Chem . Phys., 4, 283 (1936). 



CHAPTER XVII 

ELECTRIC AND MAGNETIC PHENOMENA* 

17a. Moments Induced by an Electromagnetic Field. In order to 
discuss the phenomenon of optical rotatory power, as well as certain 
other phenomena, we need expressions for the electric and magnetic 
moments induced in an atomic or molecular system by an electromag- 
netic field. The procedure to be followed in calculating these moments 
will be quite similar to that in Chapter VIII in the discussion of radiation 
theory. 

The wave functions for the unperturbed molecule will be of the form 

L^t the normal state of the molecule be ^ Then, in the presence of an 
electromagnetic field the wave function for the molecule may be written 


as 

* = + E*(0*S 

17-1 


b 


where 

<*Mh'W) 


and 

H ' = A'-p' 



In equation 17*1 the assumption has been made that the coefficients of 
all wave functions except are very small. 

In developing the principles of radiation theory we made the approxi- 
mation that A could be regarded as constant over the molecule; this 
assumption gave the result that the probability of a transition between 
two states was proportional to the matrix element for the electric dipole 
moment between the two states. For the discussion of optical rotatory 
power this approximation is insufficient. The value of A at any point 
in the molecule may be expressed by a Taylor’s series expansion in terms 

* See General References. 
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of the value of A and its derivatives at the origin of a coordinate system 
fixed in the molecule. For the x component of A at the position occupied 
by the jth particle in the molecule we will have, neglecting higher terms: 


To this approximation the perturbation H' may be written as 


17-2 


H' = - -Z— 
c j rtij 


Terms 

A . V + * (v . A ,„. ( rV ) + — ? 

moment 


17-3 


The first and second terms in H' include the operators for the electric 
and magnetic dipole moments, respectively. The electric quadripole 
moment will be of no importance for our purposes ; hence this term in H' 
will be disregarded. The matrix element (’4 , 5*|H / )! r 2) is therefore 


(«j*ih'|«®) - p y k) • A ° 

lc \ | j nij | / 

or, since 

(*SVI«2) - m,-| (*2*M*2) 

(*S*|H'|*2) = - (i>|R|a) • A 0 + (b\U\a) ■ (V-A) 0 J 


174 


17-5 


where 


E ba = E b - E a 


(6|R|a) = J *° b *(Zer 3 WadT 

(6|M|a) = f 4?* {Z (r dr 


We now introduce explicitly the time dependence of A 0 as 


A 0 = JA8(4‘ + e 4 


17-6 
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where t — hv and v is the frequency of the electromagnetic field which is 

deb 

perturbing the molecule. The coefficients — are thus 

at 

¥ = (6lR|o) * ^ + (blM|a) * (VxAS) } x 

(e 1 * + e * / 17-7 

Integrating with respect to time, we have 


C6 = ^ 0\R\a) • Ag + (6|M|o) • (V*Ag)l X 


.Eba-^-t A .Eba—* A 

e~*~ e nr‘i 

Eba 4 " € Ebct ““ € 


+ Constant 17-8 


The constant in this expression for c*> is not a function of the time- 
dependent perturbing field and may for convenience be put equal to 
zero in this problem. 

According to section 8g, the electric dipole moment of the system 
represented by the wave function is 

R = Re (¥*|R|*) 

Neglecting terms of the order of c 2 , this is equal to 

R 0 = Re j(a|R|a) + 2£c&(a|R|b)e * * * 17-9 

The first term in 17*9 represents the permanent dipole moment jjl of the 
unperturbed molecule; the second term represents the dipole moment 
R^ which has been induced by the perturbing field. Substituting 17*8 
into 17*9, the expression for the induced dipole moment becomes 

K = Re {[e ^ (a|R|6)(fe|R|a) • Ag 

+ (a|R|6) (6|M|a) • (V>< A °)]^^- + 17-10 

or, since 

Eta + eEta-e El - e 2 
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K = Re 


+ 


-5(^4*-^)] 

E(o|R|6)(6|M|o) • [ ^r <2) WO (*'*'+ e 

~«^ (v 4*- r *)]} 1711 

From the relations E=~-~A, H = V x A between the fields E and H 

c at 

and the vector potential A, we have 

(e'** 4 - e *#0 

17*12 


dE° 

€ 2 o ( « 

-t — 1 

dt 

= 2 

* + e 

H° 

= i(vxAg) C 

*!< , - 
i * + e 

dH° 

U , n 

( *s ( 

dt 

- 5 

Ve * — 


Using these relations, equation 17*11 reduces to 

K = 2Re\L JgTZTj) (olR]6)(&|Rl«) - E° 

. _ ihEle . . dE° 

+ ?( «£-. V (,l|R|t)(tlR|,,) "5~ 

+ E ,/*“ , («|R|»)(»1M|«) • H° 

b &ba * 

+ L r~ 2 («|R|6) (6|M|o) • ~1 17-13 

6 &ba ~ * 0* J 

We will be interested in the components of in the direction of the 
fields. To obtain these components, we average the expressions of the 
type RR • E over all orientations of the molecule with respect to the field 
directions, assuming all orientations to be equally probable, and obtain, 
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as in Chapter VIII, the result -J-R • RE. Equation 17-13 thus becomes 

(o|Rjb) • (b|R|a)E° 
d E° 


Ra = 

+ £ 


Eba 

(El - e 2 ) 


ihE, 


ba 


(F 2 ,, 2 (a|R|b) • (6 |R|o) 

b (E ba ~ « 


dt 


+ £ 
b 

+ £ 


Eba 


(a|R|6) • (6|M|o)H° 

JXX° 

T\ (°I R I&) * (6|M|o) — 

6 K^ba — t ) dt 


b (El - * 2 ) 

ih 

l Ka 


17-14 


The induced magnetic moment M„ is obtained from 17-14 by replacing 
(a R| b) - (b|R|a) by (a|M|b) • (&|R|a) and (o|R|6) • (b|M|a) by 
(a M|6) • (&|M|a), as is apparent from the derivation. In taking the 
real part of 17-14, we see that, since 

(a|R|6) • (6|R|o) = |(a|R|b)| 2 

dE° 

is real, the coefficient of — — is purely imaginary, so that its real part is 

at 

, dH° . 

zero. Similarly, for M', the coefficient of — — is zero. 

dt 

(a|R|6) • (6|Mja) can be complex, so that 

Re{i(a\R\b) • (6|M|a)} - -Im{ (a|R|&) • (b|M|a)} 

where Im means that we are to take the imaginary part of the term in 

d!E° 

brackets. In calculating M^, the coefficient of — - can be further 

dt 

reduced, since 


jp2 

** ^ ba 
in —7> 

y 1 

b (El - e 2 ) 

ih 

b 


(a|M|b) • (b\R\a) 


= - 5 £(a|M|6)-(6|R|a) + £— r 

'■ * b \&ba 


ih 


* 2 ) 


(o|M|b) • (b|R|o) 


ih 


« -2 (a\M - R|a) + £ 


ih 


7 (El - e 2 ) 


(a|M|b) • (6|R|o) 


The first term is purely imaginary, so that its real part is zero. We 
note further that, owing to the Hermitian character of R and M, 

[(o|M|b) • (b|R|o)l = [(«|R|b) • (b\HL\a)}* 
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so that 

7m[(a|M|6) • (6|R|o)] = -Im[(a|R|6) • (b|Mja)J 

Making use of the various relations given above, we have the final 
expressions for the induced moments: 

R£ = «X + y«H / --| 3 
v c at 

M' = k 0 H / + 7oE' + - ft, a ~ 17-15 

c at 

The superscripts on the fields have been dropped, but the fields have 
been written as E', etc., to denote the fact that they are the total effec- 
tive field at the molecule, which may not be the same as the external 
applied field. The quantities a', K a , y a , @ a are, as may be seen from 17 *14 

, _ 2 Pj, a | (a[R|6)| 2 
3 h t vl ~ v 2 

2 Pb»|(q|M|fr)l 2 

3 h » V 2 a - V? 17 . 16 

2 ^ vfcJSej (a|R|6) • (&|M|a)} 

3*t v\a~? 

c /m{(a|R|b) ♦ (b|M|a){ 

Pa 0 , 2^ 2 2 

Oirh h v b a — v z 

, Eba , f 

where Vb a = -r~ and v = - • 
a A 

17b. Dipole Moments and Dielectric Constant As shown in 
Chapter VIII, the matrix element (a|M|&) for the magnetic moment will 
in general be veiy much smaller than the corresponding matrix element 
(a|R|6) for the electric moment. Of the quantities a , k, f}, y listed 
above, k, 0, y will thus be much smaller than a, so that, to a first approxi- 
mation, and one which is sufficient for many purposes, we may state 
that the effect of an electromagnetic field on a molecule which is in the 
state a is merely the induction of an electric dipole moment 

R£ = a'E' 17-17 

where a' a is the polarizability of the molecule in the state a. If the matrix 
element (a|R[a) is different from zero, that is, if the molecule possesses a 
permanent dipole moment fi a , there will be an additional contribution to 
the polarizability arising from a partial orientation of the molecules with 
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their dipole moments pointing along the field direction. We may calcu- 
late this contribution to the polarizability in the following manner. 

Let us consider that the molecule is subjected, not to the influence of 
an electromagnetic field similar to that associated with light, but to a 
constant electric field in the z direction, of strength E x . For this case, 
the vector potential A may be taken equal to zero ; the scalar potential <p 
is — zE z , The classical Hamiltonian function for a system of charged 
particles will then be 

H - H 0 - HeiZiE f 17-18 

* 

where Ho is the Hamiltonian function for the system in the absence 
of an external field. For this system, the dipole moment is 

dH 

M* = I >.2,- = - 17-19 


Quantum mechanically, the relation analogous to 17-19 will be 

_ d € <* 

Mo ‘ dE 2 


17-20 


where e a is the energy of the molecule in the state a, this energy being 
expressed as a function of the perturbing field. 

The quantum-mechanical method of calculating the contribution to 
the polarizability arising from the orientation of the permanent dipole 
moment of a molecule may be illustrated by the simple example of a dia- 
tomic molecule with a permanent moment along the internuclear 
axis; this calculation, in fact, leads to results of quite general validity. 

We treat the diatomic molecule as a rigid rotator in space; the wave 
functions, in the absence of a perturbing field, are 

ij, M (1 9 , <p) = P l f l (cos d)e iMv 17-21 


and the associated energy levels are 

J(J + 1 )h? 
e ° 8 tt 2 7 


17-22 


The perturbation is —hqE* cos 6, so that the first-order perturbation 
energy is 


e = W'jr, m\ —HoEt cos e\ypj, m) — 0 17*23 


as may readily be seen from the known properties of the wave functions 
i>j, u. In order to evaluate the second-order perturbation energy we 
need the values of the integrals 

Hj, Mi M’ = i'pj, m\ —/to-®* COS M') 


17-24 
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These integrals are similar to those involved in the determination of the 
selection rules for electronic transitions in the hydrogen atom, and are 
zero unless M = M' , J = j' ± 1. From the recursion formulas 4-85, 
we find for the squares of the pertfnent integrals : 


| Hj, M ; /— 1, m\ 2 
\Hj, M; J+l, m\ 2 


= M 

= Mo®* 


\ (J+M)(J-M) j 
l(2J + l)(2J- l)j 
f (J + M + l)(J-M + l) l 
I (2/ + l)(2/ + 3) J 


17-26 


•The second-order perturbation energy will be 


// . \Hj, M; J+l, m\ 2 

*J,M = 1 

ej — tj_ i tj — ej + 1 


For J = 0, M = 0, we have 

For J 9^ 0, we obtain 


„ _ 8 tt 2 / &E 2 

e oo — ~ 


jf 

e J , M 


SjWo Ef 

2 h 2 


2h 2 3 

3M 2 - J(J + 1) 


J(J + 1)(2 J - 1)(2J + 3) 


17-26 


17-27 


17-28 


According to 17-20, the z component of the dipole moment when the 

q n 

molecule is in the state = . Since the energy 

dE z 

of a state, to a first approximation, depends only on J, it will be con- 
venient to calculate 

M-+J 

/*/, z — L 17-29 

at—/ 

Now 


J (J + 1)(2«7 + 1) 


M—J 

£ M 2 = 2(1 2 + 2 2 + 3 2 + • • • + J 2 ) = 

M~-J 6 

so that 

U E [(3M 2 - J(J + 1)] = J(J + 1)(2J + 1 ) 

M--J 

- (2J + l)(/)(/ + l) =0 

We thus see that vj, * = 0 for J 0. For J = 0, we have 

8ir 2 lrfE,' 


17-30 


Pa, x 


Bh 2 


The average z component of the dipole moment is thus just mo, * multi- 
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plied by the probability that the molecule will be in the state with 
/ = 0. According to 15-26, this probability is equal to 

No _ 1 

N - — 

E« kT 

i 

where is the energy of the ith state measured from the state with 
J = 0. If we neglect the second-order perturbation energy, we obtain, 
according to 15*59, the result 

„ SrlkT 

so that the contribution to the polarizability arising from this effect is 
— — . This result is identical with that obtained from the well-known 

OKI 

2 

Mo 

classical theory. It is to be noted that the final result —— is valid only 

OrCl 

for low field strengths (neglect of e [ 9 in c*), and high temperatures 
(replacement of summation by integration), but these conditions are 
adequately fulfilled in the usual measurements. Although this result 
has been here derived only for a diatomic molecule with a permanent 
dipole moment, a similar treatment shows that the result obtained is of 
general validity. 

The above contribution to the polarizability arises from the partial 
orientation of the molecules with their permanent dipoles along the field 
direction. In deriving the expression for a ai we assumed all orientations 
to be equally probable. The effect on a a of this partial orientation is of a 
higher order than the effects considered above and will be neglected. 
Our above derivation assumed the field to be stationary. If this is not 
true, the results must be modified. The nature of the necessary modifi- 
cation may be seen as follows. If the frequency of the field is much less 
than the frequencies associated with molecular rotation the molecule 
will be oriented in the manner discussed above; if the frequency of the 
field is much greater than the rotational frequencies the molecule will 
not be oriented with respect to the field, and there will be no contribution 
to the polarizability arising from this effect. Since molecular rotational 
frequencies correspond to wavelengths in the far infra-red, there will be 
no molecular orientation when the perturbing field has the frequency of 
visible light. We thus have the final formula for the dipole moment 
induced by an electric field 

R a « a a E' 
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with 


where 


<*o 


Mo 


+ a a (stationary fields) 


3kT 

a a = a' a (visible light) 




n a = |(o|R|a)|; «o = ^L— 2 

oh b Vba v 

To complete the discussion of the relation between dipole moment and 
dielectric constant, we consider a material medium containing N\ 
molecules per cubic centimeter, all of the same kind and in the same state 
a (the extension to mixtures is obvious ; if more than one state need be 
considered, it must be weighted with the appropriate Boltzmann factor). 
The polarization of the medium will be P = iViR. For an isotropic 
medium, we may use for the effective electric field the Lorentz field 

E' = E + yP 

If E' is expressed by this relation, the polarization of the medium will be 

N“ i<xK 


P = 


1 - 


4:irNia 


From electromagnetic theory, we have the relations 

D = eE = E + 4 ttP 

where e is the dielectric constant of the medium and D is the electric 
displacement vector. In terms of the polarizability, the dielectric 
constant will be 

AirNia 


€= 1 + 


1 - 


AirNia 


or 


6 ~ 1 


3 

47rA^ia 


e + 2 3 

For stationary or almost stationary fields, we have 


e z 1 4lttN 

€ -f* 2 


4^/m 2 , A 

3 \3 kT + a ) 


17-31 


17-32 
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which is the relation used when dipole moments are determined from 
measurements of dielectric constants. The usefulness of a knowledge of 
the dipole moment in investigating the structure of a given molecule is so 
well known as to require no further comment at this point. 

17c, The Theory of Optical Rotatory Power. In order to under- 
stand the origin of the phenomenon of optical rotatory power, we must 
include the higher-order terms when calculating the moments induced 
by an electromagnetic field. We therefore write, for the fields associ- 
ated with visible light, 

/ 4tt \ 3 d 

R = a[E + — P) - “H 17*33 

\ 3 / cdt 

™-‘ s +-M e+ ? p ) i7 - 34 


where a = 5ZPa«a> & = Jlpafia, k = X)p a K a , and pa is the probability that 

a a a 

the molecule will be in the state a. For the effective electric field we have 

47T 

again used the Lorentz field E' = E + — P; for the effective magnetic 

o 

field we have set H' = H, since the magnetization is in general very 
small. The terms in y a have been neglected, since it may be shown that 
their inclusion would have only a second-order effect on the optical 
rotatory power. 1 The electric induction D and the magnetic induc- 
tion B may, with the aid of 17*33 and 17*34, be calculated to be 


D = E + iwNiR 

D = eE — g^-H 
at 

B * H + 4jrA r 1 M 


' 3 + 8t#i« \ 
s 3 — 4 tNi<x) 


\2irN x - \ , 

«_ i H 

3 — 47rA r i a ] dt 


17-35 


12rNi — 

"* (1 -f- 4TiST 1 /t)H + ■- ; T C - — E (to the first order in jS) 

3 — 4:irNia dt 

= KH + E 17-36 


1 E. U. Condon, Rev . Modem Phys . 9, 444 1937. 
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where 

€ — 1 47riV\a 

r+2 = 3 

K = 1+ 4 tN iK 

g = 17 . 37 

C o 


The nature of the electromagnetic field in a region free from real charges 
or real currents is specified by Maxwell’s equations 


V-D = 0 


V*E 


- — -B 
c dt 


V -B = 0 


V*H = - -D 

c dt 


17-38 


For any particular medium, these equations must be solved subject to 
relations of the type 17*35 and 17*36. Let us suppose, for the moment, 
that the parameter g is zero, so that D = eE,B = KH. From the equa- 
tion for V x E, we obtain, by taking the curl of both sides, 

1 d Kc d 2 

V*V*E = — V 2 E + VV • E (V X B) r-^E 

cdt c 2 dt 2 


or, since V • E = 0, 


2-e* Ke d 2 „ 
V 2 E = — -jE 
c 2 dt 2 


17-39 


In the same manner, we find 


V 2 H = 


Ke 

C 2 dt 2 


H 


17-40 


These equations represent waves propagated with a velocity 

c c 


where n is the index of refraction. In non-magnetic media K S 1, so 
that n 2 = e, which is the familiar relation connecting index of refraction 
with dielectric constant. We will now investigate the solutions of 
Maxwell’s equations for the case where g is not zero. 

For our present purposes, we will investigate the nature of the solu- 
tions representing plane waves propagated along the z axis. The equa^ 
tion V • D = 0 may be written as 
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dD dD 

For our plan© wave, — - = — - = 0, and so we conclude that D t = 0. 
dxdy 

In the same way, the equation V *B = 0 requires that B z = 0; since 
D z = B z = 0 we must have E z = H z = 0. We now specialize our 
desired solution even more; we require that the solution represent right 
circularly polarized light. For this type of wave we have 

E = E(i cos \f/ — j sin 1742 

( nz\ 

t — —I is the phase of the 

wave of frequency v propagated with a velocity v = - . At t = 0, 

* = 0, E is directed along the x axis. As t increases, E rotates toward the 
— y axis, or, to an observer stationed so that the light enters his eyes, 
E rotates clockwise. From the curl equations, we have 


V*E = — i 
2i rvn 


dE„ 


^ .dE x 


1 d 


E{ — i cos \p + j sin \p} B 

C C dt 


or 


B = nE{ i sin ^ + j cos \p} 

In 17*36 we make the approximation that K = 1. Then 

_ _ , dE __ 2irvg 

B=H+ ? — = H -— j y -B 
at n 


1743 


or 


H = B - (» + 2™g)E{i sin * + j cos *) 17-44 


From the curl equations, we have 

2wn 


1 dD 

V*H *= (n -f 2wg)E (— i sin t — j cos \f/) = - — 

c c at 


or 


D -» n(n + 2irvg)E( i cos ^ — j sin = n(n + 2rvg)E 17-45 
But, according to 17-35 and 17-44 

D 
or 




d 


n 


nt 


n + 2rvg 


E 


1746 
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In order that 17-45 and 17-46 be consistent, we must have 


fit 

n(n + 2irvg) - 

n + 27 rvg 
or 

n R = — 2irvg 

17-47 

where n R is the index of refraction for right circularly polarized light. 
For left circularly polarized light, we may write E as 

E = E(i cos 4/ + j sin 

17-48 

Proceeding in exactly the same manner as above, we 
case the index of refraction is 

find that in this 

n L = e** + 27 xvg 

17-49 

We therefore write 


+ 8, ~ t o — 8 

17-50 


where = 2wp 


( nz\ . 

‘-t ) 1 


is the phase of a wave propagated with the 


mean index of refraction n = e H and 5 = 47 v 2 v 2 g - . The superposition 

c 

of right and left circularly polarized light waves of equal amplitude 
gives a plane polarized wave. Adding 17-42 and 17-48, we have 

£ = 2?{i[cos (^o + $) + cos (\bo — 5)] 

+ j [- sin (^o + 5) + sin (^ 0 - 8)]} 

= 2 E cos ^ 0 {i cos 8 — j sin 5} 17*51 


For 8 = 0, E is along the x axis ; for 5 > 0, E has been rotated through an 
angle 8 in the clockwise direction as viewed by an observer looking along 
the — z axis. A medium for which 8 is positive is therefore dextro- 
rotatory. The rotation in radians per centimeter is thus 

&ir 2 v 2 g 

* z c 


or, according to 17-37 and 17-16, 

f 47 r 2 V 2 4:irNl € + 2 

* = ~ 7 ~ — — 


16tt 2 %t (n 2 + 2\ _ _ v 2 R ba 
* (' — 5 1 £paE ~2 ^2 17-52 

ich \ 3 / a b V ba — V* 

where R ba = Im{(a\R\b) • (6|M|a)J is the “rotatory strength” of the 
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transition a — » b. From the theoretical viewpoint, the calculation of the 
optical rotatory power of a molecule is thus reduced to a calculation of 
the matrix elements (a| R|fo) and (b|M|a). Since the eigenfunctions of 
complex molecules are not known to any high degree of accuracy, it has 
not as yet proved possible to determine the absolute configuration of any 
molecules by actual calculation. For a discussion of the attempts 
which have been made in this direction, as well as of related topics from 
this viewpoint, the reader should turn to one of the review articles in 
this field. It is possible, however, by a study of the symmetry proper- 
ties of the rotatory strengths, to state the conditions necessary for optical 
activity. 

The operators R and M in rectangular coordinates are 
R = e{ix + j y + kz\ 



If the molecule has a center of symmetry we may classify the states of 
the molecule as odd or even according as the wave function for a given 
state changes sign or retains the same sign when subjected to an inver- 
sion at the center of symmetry, that is, when each coordinate is replaced 
by its negative. Since the operator R changes sign upon inversion, we 
have a non-vanishing value of (a|R|b) only between odd and even states. 
The operator M does not change sign upon inversion; hence we have a 
non-vanishing value of (o|M|b) only between two odd or two even states. 
The scalar product (a|R|b)-(b|M|a) will therefore be identically zero for 
all states a and b, and the optical rotatory power will be zero. If the 
molecule has a plane of symmetry, we may again classify the wave func- 
tions as odd or even with respect to reflection in this plane. Let the 
plane of symmetry be the yz plane. Then, with respect to reflection in 
this plane, the x component of R is odd and the x component of M is 
even. There will be no pair of states a and b for which the x components 
of (a|R|b) and (b|M|o) are both different from zero. The same is true 
of the y and z components. A fundamental requirement for optical 
activity is thus that the molecule possess neither a plane nor a center of 
symmetry. 

We now consider the values of (a|Rjb) • (b|Mja) for a given molecule 
and its mirror image. A molecule may be transformed into its mirror 
image by reflection of its coordinates in any plane; this reflection is 
equivalent to changing from a right-handed to a left-handed coor- 
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dinate system. If we reflect a molecule in the xy plane, the new 
value of (o|R|b) • (6|M|o) for the molecule is obtained by replacing 
z by — z in the wave functions a and b in the corresponding expression 
for the original molecule. If we change z to —z in both eigenfunctions 
and operators, the product of the matrix components will remain un- 
changed, since the values of the integrals involved, being pure numbers, 
are independent of the particular coordinate system in which they are 
evaluated. From the form of the operators, we see that R • M changes 
sign when z is replaced by —z; therefore (o|R|b) • (&|M|a) must change 
sign if we replace z by — z in the eigenfunctions only. An unsymmetrical 
molecule and its mirror image must therefore have equal optical rota- 
tions, but with opposite signs. If a molecule is identical with its mirror 
image, its optical rotation must vanish. These conditions for the exist- 
ence of optical activity are, of course, identical with those which have 
long been known. 

17d. Diamagnetism and Paramagnetism. According to equa- 
tion 8*15 the Hamiltonian operator for a system of charged particles in 
an electromagnetic field is 

H = H 0 + H' 


where H 0 is the Hamiltonian operator for the system in the absence of 
the field and 

H ' = L (ih ~ c Vi • A + 2ih e -A, • Vi + e - 2 | A.-I 2 ) + e, Vi J 17-54 

Let us consider a uniform magnetic field of magnitude H t along the 
z axis. The potentials are 

A-Xf = 2^ z Vi\ Ay i “ A t — 0 ) tfi — 0 


From these values of the potentials we readily find the relations 

H 2 , 


Vi • A, = 0 


A, 


(*, ? + yi) 


LA, • v,- = 



H z 2iri 

T T 


JVL 


17-55 


where M, is the operator for the z component of the angular momentum 
of the system. The perturbation H' can thus be written 


H' = 


2mc 


HM,+ 


8 mc‘ 


£(*? + »?) 


17-56 


If we include the interaction of the magnetic field with the electron spin, 
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the perturbation H' becomes, according to 9*11, 

h ' - - + 2s -> + s m ™ 

The term in Hi will in general be much smaller than that in H z and will 
be of importance only where the eigenvalues of M z and S* are zero. We 
will discuss the term in Hi , which is responsible for diamagnetism, later, 
and will for the present concentrate our attention on the term in H tf 
which is responsible for paramagnetism. 

For atoms or ions, a portion of our discussion will parallel the discus- 
sion of the Zeeman effect in Chapter IX. The paramagnetic term, for 
an arbitrary magnetic field H, is 

H' = — r— {H • M + 2H ■ S} 17-58 


For an atom in the state characterized by quantum numbers L and S, 
the first-order perturbation energy will be 


E\L, S) = - — — — {H-L* + 2H-S*} 
iirmc 


where L* is a vector of magnitude VL(I + 1) and S* is a vector of 
magnitude VS(S + 1) The magnetic moment for this state will be 
given by the relation analogous to 17-20, that is, 

../r cs dE'(L,S ) eh (T . , 


I S ) = 


{L* + 2S*| 


= (L* + 2S*)*i 0 17-60 

ch 

where mo = *: is the Bohr magneton.” The component of u(L, S) 

4:irmc 

along the vector J* representing the total angular momentum will be 
\lj(L, S ) - mo{L* cos (L*, D + 2S* cos (S*, J*)} 

= gJ*no 17-01 

where 

, ,/(/ + !) + S(S + 1) - L(L + 1) 

" _1+ wTi) 

In the presence of a magnetic field, the atom is so oriented that the pro- 
jection of J* along the axis of the field has the quantized values M = J, 
J — l • • • — J. The average value of the component of the magnetic 
moment of a given atom along the direction of the field, provided that 
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only the ground electronic state has an appreciable chance of being 
occupied, is 

qMuqH 

EffM/* 0 e kT 

P ” qMimH 17-62 

Ee kT 

u 


For small values of H, we may expand the exponential and retain only 
the first two terms. Then 


^oEM+^em 2 

m kT M 

£(■ 


17-63 


Since 

El = (2 J + 1), 

M 


EM = o, em 2 = £S£±VJ ? £±±l 

MM 3 


equation 17*63 reduces to 


A - 


jv_ + 

3kT 


H 


17-64 


The contribution to the magnetic susceptibility arising from the pres- 
ence of permanent magnetic dipoles in an atom or ion is thus 


J(J ± 

ZkT 


an expression which is very similar to the contribution 


to the electric polarizability arising from the presence of the permanent 
electric dipole. This paramagnetic term vanishes for atoms with 
J — 0; for atoms in S states we have J = S, and the paramagnetism 
arises entirely from the unpaired spins. Most molecules have zero 
orbital angular momentum and no unpaired spins in their ground states, 
and thus show no paramagnetic effects. A notable exception is 0 2 , 
which has the ground state 3 27, and thus exhibits a paramagnetic 
effect arising from the unpaired spins. The exact value of the para- 
magnetic susceptibility for molecules depends upon the nature of the 
coupling between the spin, orbital, and rotational angular momenta. 

All atoms or molecules, regardless of whether or not they are para- 
magnetic, show diamagnetic effects. The perturbation energy arising 
from the second term in 17-56 is 


E' 


e 2 ff 2 
8 me 2 


Z (afi+tl/?) 


17*65 
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where (x 2 + Vi) is the average value of (x* + y\). Since, to a first 
approximation, all directions in space will be equivalent, xj = j / 2 = = 

where r 2 is the distance of the electron from the center of gravity 
of the system, so that 


E' = 


e a fl 2 2 -5 
8 me 2 3 i r< 


17-66 


The associated magnetic moment will be 


>HH r i> and the corre- 


sponding susceptibility is — 5 which we note to be negative. 

bmc i 

The complete magnetic susceptibility for an atom or ion will thus be 


e 2 ? J(J + 1 )g 2 /4 

6 me 2 i ’ 3 kT 


17-67 
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19a. Van der Waals’ Forces. The weak attractive forces between 
atoms which are not connected by ordinary valence bonds have long 
been known by the name of Van der Waals’ forces. If the particles 
possess permanent dipole moments, certain forces will arise from this 
cause; these forces can be calculated from classical theory. Van der 
Waals’ forces exist even if the particles are symmetrical; the origin of 
the forces may be most simply seen by consideration of the interaction, 
of two hydrogen atoms when the interatomic distance is large. 

For the hydrogen molecule, the wave function, in the Heitler-London 
approximation, may be written as 

4' = Ml)M2)+MVMV 18-1 

If the intemuclear distance is sufficiently large, we may assume the 
electrons to be definitely located on one or the other hydrogen atom, and 
we may write the wave function for this system as 

* = iMl)&(2) 18-2 

The first-order perturbation energy will then be 

E' = J ^(l)^(2)HV a (1)^(2) d n dr 2 18-3 

where 



the symbols — , etc., having their usual significance. 
T a b 



If, in the coordinate system as illustrated in Figure 18-1, we let 
(#i, j/i, «i) be the coordinates of electron (1) relative to nucleus o, and 
(*a, j/ 2 , ^a) the coordinates of electron (2) relative to nucleus b, the 
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analytical expressions for the distances are 

^ db = ^ 

r 12 = V( Xl - x 2 ) 2 + (yi - y 2 ) 2 + (*i - 2 2 - R) a 
Ta2 = ^ + 2/1 + (R + Z2) 2 

rji = V*f + yi + (zi - R) 2 


Since the atoms are assumed to be far apart, R~> x 1 , R x 2 , etc. 
Since 

vrr* 2 + 8 e + 

we obtain, upon expanding the expressions in 184 and retaining only 
the first two powers of the coordinates of the electrons, the results 

J_ _ 1 

Tab R 
1 1 


r 12 R 
[- 


X 

(*1 - s 2 ) 2 + (2/1 - y 2 ) 2 - 2 (zi - z 2 ) 2 - 2 R(zi - 22) 

2 R 2 


] 


18-5 


J_ _ 1 

" xl + yl — 2 z 2 + 2 Rz 2 

T a2 R 

2 R 2 

1 _ 1 

x\ + y\ — 2 z\ - 2 Rzi 

r 61 R 

L 2 R 2 


Combining these terms gives, for the perturbation H', to this approxi- 
mation, the result 


H' = ^3 {x!X 2 + yxy 2 - 2z x z 2 \ 


18-6 


which is the “ dipole-dipole ” interaction. 

For ^o(l) and ^( 2 ) we take the Is eigenfunctions of hydrogen. 
Using the perturbation above, we immediately see that the first-order 
perturbation energy is zero, since H' is an odd function of the coordinates 
while the ^’s are even functions of the coordinates. According to 
equation 7*27, the second-order perturbation energy is 


, hM q 

0 £0 E 0 - E k 


18-7 


where the summation is over all the states k of the hydrogen atom 
except the ground state. The denominator of this summation is equal to 
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--M) 

a 0 \ n 2 / 


(since there are two hydrogen atoms), where n is the 


principal quantum number of the state fc. The denominator of the 

3 e 2 e 2 

summation thus varies from — 7 — to ; in order to evaluate the 

4 CLq Oq 

e 2 

sum, we set the denominator equal to for all terms. This gives an 

approximate value for the second-order perturbation energy equal to 


EH - 


1 


e 2 /a 0 ic7£ 0 

1 


E H&kEL 0 


e 2 /a 0 % 


E m'okHlo) - (H'ooH'oo))} 


18*8 


1 


e 2 /a 0 


( H' 2 ) 


00 


since Hq o = 0 and L(ffoJfc#*o) = (H' 2 )oo- In evaluating (H' 2 ) 00 , the 

* 

cross terms are zero for the same reason that H' 00 vanished ; we therefore 
have the result 


(H ' 2 ) oo = ~ ftii l)ti,V)[AA + vU + 4^„(1)^(2) dr, dr 2 


= ^0 [afci + M + 4z?zi] 

18-9 

Since the Is state is spherically symmetrical, 


A = y\ = A =* iA, etc. 

1810 

so that 


(#' 2 )oo = | 

18-11 


According to equation 6-33, r 2 = 3a 2 . The second-order perturbation 
energy is thus 

1812 

By an actual summation of the series 18-7, Eisenschitz and London 1 

2 5 

obtained the more accurate value E” — —6.47 ; an exact pertur- 

R 

1 R. Eisenschitz and F. London, Z. Phytik, 60, 491 (1930). 
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bation calculation 2 gives —6.499 as the value of the numerical coefficient. 
If the expansion of H 7 is extended to include higher powers of the 

coordinates, the van der Waals energy contains not only the term in i 

R 


but also terms in Jg » ^ , etc. By a procedure similar to that carried 

out above, Margenau 3 obtained for the van der Waals energy the 
expression 

6e 2 ao 135 e 2 a 7 0 1416 e 2 ag 


V" _ 

Cjq — — 


R 6 


R* 


R 


10 


18-13 


The van der Waals energy may be expressed in terms of the polari- 
zability by an argument due to London. 4 Instead of writing 

E'o = 2 ~ (H 12 ) oo, we write E" = — A (H' 2 ) 00 , where I may be 

6 l (X o ZL 

taken to be the first ionization potential with the same degree of accu- 
racy as we formerly took it equal to the ionization potential. According 
to 176 the polarizability of an atom in a stationary field (v = 0) is 

o p 2 i 

or = — Z — (a|r|6)-(6|r|a) 
oh b v db 

Ar> "^proximate value for the polarizability is therefore 

2 z> 2 1 

— E(a|r|&).(ft|r|a) 


18-14 


a = 


3 h Va,b b 

2e 2 


2e 2 - 0 


Zhv a b 


(ojr 2 !®) = — r" 


18-15 


where P is an average value of v a i>, and hv a b has been replaced by I. 
3 I 2 a 2 

(H' 2 ) oo is thus equal to - — =- ; to this approximation, and the van der 


2 R 


Waals energy is 


EX 


3 lc?_ 

4 R 6 


18-16 


London has shown that, if the atoms are not equivalent, the analogous 
expression is 


pn _ _ 3 IaI_b OACtjB 
0 " 2I A +I B R 6 


18-17 


2 L. Pauling and J, Beach, Phys. Rev., 47, 686 (1936). 
8 H. Margenau, Phys. Rev., 38, 747 (1931). 

4 tf. London, Z. Physik, 63, 246 (1930). 
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where I a and a A are the ionization constant and polarizability of atom- 
(or molecule) A and I b and as are similar quantities for atom B. 

For many-electron atoms, the perturbation energy is a sum of terms 
of the type 18-6, with one term for each pair of electrons. Further 
details on the van der Waals forces in complex atoms may be found in 
the review article by Margenau. 6 

18b. The Quantum-Mechanical Virial Theorem. 6 In classical 
mechanics, the virial theorem states that 

XX = -XXX, 18-18 


where the bars denote time averages and F Xi is the x component of the 
force on the ith particle; the left side of this equation is 2f, where T is 
the kinetic energy of the system. This theorem also holds in quantum 
mechanics, as may be seen from the following argument. Schrodinger’s 
equation for a system of particles is 

dfy 


ft 2 #4 

8 5r 2 m,- dx 2 


+ (7 - E)i = 0 


18-19 


Operating on equation 18-19 by x$* — , we obtain 

OXj 


i 8 x 2 m,- 


xtf* 


d 3 \l/ dV d\L 

+ x/-^ + x/(F-£)/ 
dxj dXj 


dxi dXj 


0 18-20 


while from 18-19 we have 


d\l/ 

xjTiy-E) = e 


h 2 dt d 2 t* 
Xi 


18-21 


dXj i %7t 2 7Yli 3 dXj dx 2 
Substituting 18-21 in 18-20, and summing over j, gives 
V- h2 v- (,* ** an. n 1DOO 

? 8ir 2 m£ X, \ dx 2 dXj dx 3 - dx 2 ) + ^ dxj * 

It is possible to integrate the first term in equation 18-22. We have 


1*2 


XX 

d 7 3 dx 
dxi 4>* 


j \ dxi dxj 1 dxi dx, 3 dxj dxj 

18-23 

dXi . j \ 3 dXidx j 3 dxj dXi/ 


6 H. Margenau, Rev . Modern Phya ., 11, 1 (1939). 
• J. Slater, /. Chem. Phya 1, 687 (1933). 



356 


SPECIAL TOPICS 


Therefore 


dXi 



3 ° x 3 


dXi 


+ 2>y (t 


ay 

dx^ dXj 


df ay*\ 

dx 2 / 


18-24 


or 



ty d 2 t*\ 
dXj dxj ) 


d*\b 

- wt 5+ 


dxi 


_d_ 

dXi 


i* 3 i J j?> 

dxi yp* 


18-25 


When 18-25 is integrated with respect to x,-, the last term vanishes 
because the term ^* 2 is zero at ± » . Integrating 18-22, we thus obtain 


2E~ 


h 2 

Sir 2 mi 




18-26 


dV 

Since F x . = — - — , this may also be written as 


2 J>(? B* dr ' S r <?*''’■*** 18 ' 27 


or 


2f = - Lx,* 1 * 

/ 


18-28 


where the bars now represent quantum-mechanical averages. 

If the potential energy arises entirely from the interactions of charged 
particles, the term Y^XjF Xj takes a particularly simple form. The 

3 

potential V for such a system may be written as V = ]£ V mn with 

mj*n 


v mn = 


, where « m and e„ are the charges on particles m and n a dis- 

Tmn 


tance r mn apart. We will now calculate the quantity Exj • 
for this pair of particles. Since 


dV m 


dx. 


r mn « V (x m - X„) 2 + (y m - 2/n) 2 + (Zm - 2n) 2 


^mw __ (#m ^n) ^ ^mn ___ %n) * ^ 

^ r wn dx n r mn 


we have 
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We thus obtain the result, since 


dV 


dfmn , 

— » etc. 

dr mn dx m 




av m 


dXj 


- "X 2 { (x m ~ X n ) 2 + (y m - y n ) 2 + (z m - 2„)*| 


€ m € n 


= -V* 


or, summing over all pairs 

v dV v 

• - r- " v 

i dXj 

For this particular case, equation 18-28 becomes 

2 T = -V 


18-29 


18-30 

18-31 


for a system of charged particles. 

For & diatomic molecule, the virial theorem can be put into the 
following useful form. If the potential energy of the nuclei (the elec- 
tronic energy) is E(r), the external force required to hold the nuclei 
d E(r) 

fixed at a distance r is — - — . If we apply this force to the nuclei, the 

dr 

virial theorem 2T = — Y.XjF Xj becomes 


2T' = -V' -r 


dE(r) 

dr 


18-32 


where T' is the kinetic energy of the electrons only and V' is the poten- 
tial of the electrostatic forces. Since the nuclei have no kinetic energy 
in this case, the energy E is equal to T' + V'. Using this relation, we 
may solve 18-32 for T' and V separately, obtaining 


T' - -E - r 
V’ - 2E + r 


dE 

dr 

dE 

dr 


18-33 


The potential energy curve for a diatomic molecule can be repre- 
sented with reasonable accuracy by the Morse curve 


or 


E(r) = D{1 - e — 

E(r) = D { — 2e -o(r-r,) + e ~ 2a(r ^} 


18-34 
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where the energy zero has been chosen so that E(r) =0 when r = ». 
For purposes of illustration, let us take D = 100 kcal., a = r, => 1. 
The resulting _curves_ for E, T', V' are plotted in Figure 18-2. At 
r = oo, E = T' = V' = 0. As the intemuclear distance is decreased, 
the kinetic energy first decreases, then increases rapidly as the nuclei are 
brought closer together. This increase in kinetic energy is more than 



compensated for by the decrease in potential energy, leading to the for- 
mation of a stable system. It is of particular interest that at the equilib- 
rium position 2 T' = —V'; that is, the kinetic energy of the electrons is 
— j the electrostatic potential energy, so that the binding energy E is 
equal to \V’ = — T'. 

18c. The Restricted Rotator. Of considerable importance for the 
study of the internal motions of complex molecules as well as for the 
study of the motion of molecules in crystals is the problem of the energy 
levels of a rotator moving under the influence of a potential field. As 
the simplest example, we consider a rigid rotator whose moment of 
inertia is 7, moving in a sinusoidal potential field of the form 

V(<p) = y (1 - OOS TUp) 


18-35 
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where ip is the angle of rotation. The wave equation for this system is 


d 2 t(<p) 8ir*I 

dv 2 + h? 

Making the substitutions, 


E r 


Vo 


(1 — cos rup) \ — 0 


18-36 


x = 


O r = 


TUp 

2 

8tt 2 /Fo 
n 2 h 2 
32V 2 

n 2 h 2 


18-37 




this equation reduces to Mathieu’s differential equation 
d 2 t(x) 


dx 2 


+ (a r + 2d cos 2x)ij/(x) = 0 


18-38 


the solutions of which are far from simple. We may, however, consider 
the limiting cases, where the motion resembles that of a free rotator or 
that of a harmonic oscillator. For a r 28 the equation becomes 

dV 

dx 2 


2 + a r \f/ - 0 


18-39 


±,r T 


which has the solutions 

i = e ±irx = 2 18-40 

where r is a positive integer, including zero. From the symmetry of the 
potential field, we must have 4'i‘p) = t (v + — ^ ■ This requires that 
r be restricted to even integers. Since a r = r 2 the energy levels are 
Vo nW 

2 


E r 


32x 2 / 

For the various values of n, we have 

h 2 


r = 0, 2, 4, 6 


18-41 


n = 1 

n = 2 

n = 3 


E IS. 
m ~ 2 


8ir 2 I 


m 2 , m = 0, 1, 2, 3, 4 


V h 2 

Em - ~ = ^ 2 jm 2 , m = 0, 2, 4, 6 


E„ 


Vo 

2 


h 2 

8ir 2 I 


m 2 , m = 0, 3, 6, 9 ■ 
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For n > 1, only - th of the energy levels for the free rotator are allowed. 
n 

The connection between this fact and the origin of the symmetry number 
(section 15e) in rotational partition functions is obvious. These energy 
levels are doubly degenerate. 

For the opposite case, a r <C 26, the wave function will have appreciable 
values only for 0, so that we may write 

0 + [a r + 20(1 - 2s 2 )] > = 0 18-42 

which is the wave equation for the harmonic oscillator. The energy 
levels are given by the relation o, = 2(2r -f 1 )V0 — 20, so that 

E r = (r + i)hv 0 , r = 0, 1, 2, 3 • • - 18-43 

^ r— 

where v 0 = — ^ . For the intermediate case a r ~ 0 the energy 

levels will not be well approximated by either of the above equations. 
However, the Mathieu equation has been solved exactly for certain 
values of 6; the eigenvalues a r are tabulated by Wilson 7 for values of 6 
from 0 to 40. We reproduce in Table 18-1 the exact eigenvalues a r for 
the first seven levels for 6 = 4, 9, 16, and 36, together with the values 
calculated by the above expressions for the limiting cases (in parenthe- 
ses). Values above the line were calculated by the harmonic oscillator 
approximation; those below the line were calculated by the free rotator 
approximation. 

TABLE 18*1 

Eigenvalues a r op the Mathieu Function 


For the intermediate case a r ~6 the energy 


e 

- 4 


0 ■■ 

= 9 


0 = 

16 

-4.2805 

(• 

-4.000) 

-12.262 

(- 

12.000) 

-24.259 ( 

-24.000) 

2.7469 

( 

4.000) 

- 1.3588 

( 

0.000) 

- 9.3341 ( 

- 8.000) 

6.8291 

( 

4.000) 

7.9828 

( 

4.000) 

4.3712 ( 

4.000) 

16.452 

( 

16.000) 

17.303 

c 

16.000) 

16.819 

; 16 . 000 ) 

16.150 

( 

16.000) 

20.161 

( 

16.000) 

26.009 ( 16.000) 

36.229 

( 

36.000) 

37.157 

( 

36.000) 

39.315 ( 36.000) 

36.230 

( 

36.000) 

37.21 

( 

36.000) 

40.22 i 

[ 36.000) 


0 

= 36 

-60.256 

(-60.000) 

-37.303 

(-36.000) 

-15.467 

(-12.000) 

5.1467 

( 12.000) 

24.379 

( 16.000) 

42.118 

( 36.000) 


7 E. B. Wilson, Chem. Revs 27, 31 (1940). 
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I. PHYSICAL CONSTANTS 


These values of the important physical constants are taken from tl^e 
tabulation of R. Birge, Rev . Modern Phys ., 13 , 233 (1941). Many of the 
numerical values used in the text are based on slightly different values 
for certain of the physical constants ; these differences are not significant. 


Velocity of light c = 2.99776 X 10 10 cm. • sec.”" 1 

Charge on electron e = 4.8025 X 10~ 10 abs. e.s.u. 


Ratio, charge to mass of electron 

Planck's constant 

Ratio 

Mass of electron 
Mass of proton 
Mass of hydrogen atom 

Ratio 


— = 1.7592 X 10 7 abs. e.m.u. * gm.“ l 
m 

h - 6.6242 X 10™ 27 erg • sec. 


- = 1.3793 X 10 17 erg • sec. • e.s.u.’ 
e 

m = 9.107 X 10~ S8 gm. 

M p = 1.6725 X 10~ 24 gm. 

M b = 1.6734 X 10~ 24 gm. 


— = 1836.5. 
m 


Boltzmann constant 
Gas constant 


Avogadro's number 
Rydberg constant for H 
Rydberg constant for <x> mass 


Bohr radius 


(—) 

\4v 2 me 2 / 


Bohr magneton ( 


eh \ 
\&irmc/ 


k = 1.3805 X 10“ 16 erg • degree™ 1 . 

R = 8.3144 X 10 7 erg • deg.™ 1 ♦ mol.™ 1 
= 1.9865 cal. • deg.™ 1 • mol.™ 1 
N « 6.0228 X 10 28 mol.™ 1 
R n = 109,677.581 cm.™ 1 
Roo = 109,737.303 cm” 1 

ao = 0.5292 X 10” 8 cm. 


f$ = 0.9273 X 10™ 20 erg • gauss™ 1 


1 electron volt = 1.6020 X 10“ 12 erg • molecule™ 1 = 23,052 cal. • mol.™ 1 
- 12395 X 10~ 8 cm. - 8067.5 cm.™ 1 


e 2 

— = 2Roo * 27.205 e.v. 
ao 


H. VECTOR NOTATION 

The concept of a vector as- a quantity having both magnitude and 
direction is quite familiar, as is the parallelogram rule for vector addition. 
If A is any vector, and A x , A 2 , and A 3 are the projections of this vector 
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along the x , y , and 2 axes, respectively, A may be written as the vector 
sum 

A = A x +A 2 +A s III 

It is convenient to define a set of unit vectors i, j, k, which are vectors of 
unit length directed along the x, y, and z axes, respectively, of a rectangu- 
lar coordinate system. If we now let A x be the magnitude of Ai, we 
have Ai = iA X) etc., so that, in terms of the unit vectors and the com- 
ponents A Xf A y , A g we have 

A = iA x + j Ay + k A 9 112 

There are two types of products of vectors, the scalar product and the 
vector product. If \A\ and \B\ are the absolute magnitudes of the two 
vectors A and B, and 0 is the angle between them, then the scalar prod- 
uct A • B (“ A dot B ”) is defined as 

A • B = |a||b| cos 0 II-3 

In terms of the components of A and B, the scalar product is 

A • B = A x B x + A y By + A Z B Z 11*4 

As an example of the use of the scalar product, consider the motion of a 
particle through a small distance ds, subjected to a force F which makes 
an angle 0 with the direction of motion. The work done by the force is 
the product of the component of the force along the direction of motion 
times the distance through which the particle moves, or, in vector nota- 
tion, dW = F • ds. 

The vector product A*B (“ A cross B ”) is a vector C perpendicular 
to the plane of A and B. If A is rotated into B through an angle less 
than 180°, the direction of C is that in which a right-handed screw would 
move if given a similar rotation. The magnitude of C is defined to be 
|C| = |A||B| sin 0, where 0 is the angle between A and B. In terms of 
the components of A and B the vector product is 

A*B = i (AyB z - AgBy) + j (AgB x - A X B Z ) + k (A x B y - A V B X ) 
i J k 

= A x Ay Ag IL5 

B X By Bg 

From the definition of the direction of C = A*B, it is obvious that 
(A*B) = — (B*A). Angular momenta can be quite simply expressed in 
terms of the vector product. If r is the vector distance from a fixed 
point to a particle of mass m moving with the velocity v, it can readily be 
seen from the definition of the vector product that the angular momen- 
tum of the particle relative to the fixed point is r*rav = m(r*v)* 
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Many physical laws assume a particularly simple form when written 
in terms of the vector differential operators. We define a vector opera- 
tor V (“ del ”), which, in rectangular coordinates, is 


V = 


.± .± 
dx ^ dy 



11*6 


Operating on a scalar function <p, this operator generates a vector called 
the “ gradient ” of <p: 


V,-i^ + i^ + k?2 

dx dy dz 


II-7 


The scalar product of V and a vector A gives a scalar function known as 
the “ divergence ” of A: 


V • A = + d Al + I 

dx dy dz 


II -8 


The vector product of V and a vector A gives a new vector called the 
“curl” of A: 


V=A = 


i j k 

a a a 

dx dy dz 


■Ax A v Ag 


11*9 


Of particular interest in quantum mechanics is the operator formed by 
taking the scalar product of V with itself: 


, d 2 d 2 d 2 


11-10 


The operator V 2 (“ del squared ”) is usually called the Laplacian 
operator. 


HI. THE OPERATOR V 2 IN GENERALIZED COORDINATES 

In rectangular coordinates, the operator V 2 is given by the relation 

d 2 

+ t-s • Because of the frequent occurrence of this 
dz 

operator in quantum mechanics, we will show by means of a physical 
interpretation how this operator can be transformed to coordinate 
systems which are not rectangular. The only coordinate systems that 
we shall consider are those in which the three coordinate surfaces cut 
each other at right angles. Such coordinates are known as orthogonal 
coordinates.. Let these othogonal coordinates be q u q2 f (fe. The dis- 


d 2 d 2 

dx? ^dy 2 
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tance d$x perpendicular to the surface = constant, between the two 
points (gi, q2, qz) and (qi + dqi , q2 , qz), will not in general be equal to 
dqi but may be written as dsi = hidqi, where hi may be a function of 
the coordinates. The distance between the points (qi, <72, £3) and 
(qi + dqi , q2 + dq 2 , qz + dq s ) will therefore, since the coordinate sur- 
faces are perpendicular, be 

(ds ) 2 = (dsi ) 2 + (d«2 ) 2 + ( dsz ) 2 

= h\(dqi ) 2 + h 2 2 (dq 2 ) 2 + h\(dq |) III-l 


Comparing this with the corresponding expression in rectangular 
coordinates 


we find 


(ds ) 2 = (dx ) 2 + (dy ) 2 + (dz ) 2 

-C 


d x ax ax \ 2 

dqi + — dq 2 + — dq 3 J 


Kdqi 


a ? 2 


a ? 3 


, / , , ay \ 2 

+ ( — rf ? 1 + — + ~ ) 

\a?i a?2 a ? 3 / 


dz dz \ 2 

d?i + — ) 


+ 


/az 

W 


ag 2 


a ?3 




hl ~ (*,,/ 
hl ■ (S) + © + (£. 
‘*-( 0 + ( 0 + (0 




y 


III -2 


m-3 


the other coefficients being necessarily equal to zero. 

Now suppose that the coordinate space is completely filled with a fluid 
whose density at the point (qi, q 2 , q 3 ) is p(qi, q 2 , qz)- Suppose further 
that the motion of the fluid at any point is determined by a velocity 
potential V (qi, q 2 , q 3 ) such that the velocity in any direction is given by 
, dV 

the value — — , where ds is a displacement in the given direction. Let 

as 

us calculate the rate of accumulation of fluid in a small element of volume 


bounded by the surfaces qi = q°, ?i = <?? + dqi ; q 2 - q 2 , q 2 = q 2 + dq 2 ; 
9 s = qi, 9 a = 93 + dq 3 (Figure III-l). Since the sides of the volume 
element are infinitesimal, we may assume that V is constant over any 
one surface. Now consider the surface gx = q°. The rate of flow 
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perpendicular to this surface is 

dV 1 dV TTTmA 

asj h\ dq% 

and the area of the surface is h 2 h$ dq 2 dqz. Hence the amount of fluid 
flowing through this side in unit time is 


/ dV\ h 2 h 3 dV 

P \ d^/ h2hz dq2 dqs = ~~ p X dqi dq2 dq * 

The rate at which fluid is flowing in through the other side is 


h 2 hz dV 


h 2 hz dV 


p ~hTd^ dq * dq 3 + to r arf dq 2 d93 dqi 


111*6 


111*6 



Fig. IIM. 


dqi dq 2 dq 3 


so that the rate of accumulation of fluid due to this pair of surfaces is 
d | h 2 h 3 dV 
dqi l hi dqi 

The other two pairs of surfaces contribute 
d j hih 3 dV 


111*7 


dq 2 1 h 2 dq 2 


dqi dq 2 dq 3 


and 


-L{ hti 

dq 3 l h 3 dq 3 j 


dqi dq 2 dq 3 
so that the total rate of accumulation is 


111*8 


111*9 


r d i h 2 h 3 dv i 

l + jL| 

\ h\h 2 dV 

Ldgi r hi dqi, 
* 

dq 2 

[ P h 2 dq 2 


+A 

dq 3 


hihdVH 

h 3 953/ J 


dqi dq 2 dq 3 111*10 
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The rate of accumulation of fluid per unit volume at any point is just the 
rate of change of density at that point. Hence, since the element of 
volume is 


d,T = dsi ds 2 dsz = hih 2 hz dqi dq 2 dq g 
the rate of change of density is 
dp _ 1 (” / h 2 h 2 dF\ / feifeg dF \ 

dt h\h 2 h 2 |_3?i Y hi dqj dq 2 \ h 2 dq 2 ) 

+ ±( p hh*IX] 

dqa V h% dqz/ J 


mu 


IH-12 


If we carry out the same analysis in rectangular coordinates, we find 


dp d/ dV\ d / dV\ d / dV\ 
dt dx\ dx) dy\ dy) dz \ dz ) 


= V - (pVF) 


III-13 


dp 

Since the value of — is independent of the particular coordinate system 
dt 

we use, we conclude that, in general, 

? . (p VV ) = -j- r a L ^ ) + — f P ^ 

hih^hg Ld^i \ h\ dqi/ dq2\ h2 dq2/ 


+ 


d / h\h2 
dqaX h a dqz) _ 


III-14 


Taking now the special case p = constant, we find the expression for the 
operator V • V == V 2 to be 


V 2 = 


i r* 


hih&hft L^9i 


/h^hz d \ d /hihz d \ 
\ hi dqi) dq 2 \ h 2 dq 2 ) 


+ 


dqz 


/ hihy d \ ~l 
\ h 3 dqz)\ 


111*15 


In particular, if we use rectangular coordinates, equation III- 15 becomes 

a2 

III-16 


v a* 2 + dy* + dz 2 


In rectangular coordinates, the operator for the kinetic energy of a 
particle is 

h 2 / ^2 $2 ^2 
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We thus conclude that, in generalized coordinates, the operator for the. 
kinetic energy of a particle is 



111*18 


where V 2 is given by equation 111*15. Equation III-18 may also be 
verified by a direct transformation of 111*17. 

We give below a few of the most important coordinate systems, 
including the equations for the transformation from rectangular coordi- 
nates, the explicit expression for V 2 , and the value of the volume element 

dr — dsi d$2 ds$ = hih^h^ dq\ dq 2 


1. Polar Coordinates. 
x = r sin 6 cos <p 



dr = r 2 sin 6 dr dd dtp 


y = r smQ sm ip z — r cos 6 


1 d_( . d\ , 1 

r 2 sin 9 dd \ m 60/ r 2 sin 2 6 6<p 2 


2. Cylindrical Coordinates. 

x — r cos 9 y = r sin 0 z — z 

2 1 6 ( d\ , 1 d 2 , a 2 

V ~~r7r\ T *) + ?^ + t? 

dr - r dr dd dz 



Fio. III-2, 


3. Elliptical Coordinates (Figure III-2). (<p measured from xz 

plane.) 

* = oV (p 2 — 1)(1 — V *) cos <p p — — ~!~ Tb 

R 


y - oV( M 2 - 1)(1 - v 2 ) sin <p v = - 
z ” apv ( R 2a) 
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V s - 


W - *) 




2 2 
fJL — J/ 


jB 3 


(m 2 - 1)(1 - J) <v 8 


] 


dr — — (jx 2 — v 2 ) d/z efy dtp 
8 


IV. DETERMINANTS AND THE SOLUTION OF 
SIMULTANEOUS LINEAR EQUATIONS 

A determinant, it will be recalled, is defined by the equation 


) V Pp ( a ll®22«33 • • * Ann) IV* 1 


where P„ is the operator which permutes the second subscripts and 
v is the number of interchanges of pairs of subscripts involved in P 
From the definition, it is readily seen that the determinant vanishes if 
two rows or two columns are identical, since interchange of two identical 
rows or columns changes the sign of the determinant but leaves its value 
unchanged. If we denote the above determinant by A, we can define a 
set of numbers A{ m by the relation 


dll 

012 

013 * * * 

0m 

®21 

022 

CO 

<3 

02n 

031 

032 

033 * * * 

© 

CO 

3 

0nl 

0n2 

0n3 * * * 

Onn 


A — 53 ^im,A % 

*-l 


IV-2 


From this definition, it is obvious that 53 = 0 for k m, since, 


*~i 


if da = ai m for all i, the determinant is identically zero, and, by IV-2, 

n 

13 O'ikAim is the determinant which has a ik = a im for all i. 

<-i 

If we have a set of simultaneous equations 


011^1 + d\2%2 + 013^3 + 
021^1 + 022^2 + 023^3 + 


+ 01 n%n — 

+ 02 n%n = C2 


IV-3 


0»1#1 + &n 2^2 4" 0n3#3 + ' * * + CL nn X n * C n 
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we can obtain a solution in the following manner. We multiply the first 
equation by An, the second by A 2 1, etc., and add, obtaining 

n n n 

5-, Q>ilAn%\ + 53 &i2A.i\X2 H" * * * *4“ 53 ttinA i\%n == 51 C%Ai\ IV*4 

»-l t-1 i-1 % 


n n 

But 5] - A and 53 = 0 for m ^ 1, so that 

*«i »«i 

iLCiAa 

Ax x = 53c t A,i or = - 1 — — IV*5 

i A 


The quantity 53 a-A,i is just the determinant in which a*i has been 

% 

replaced by c*. The values of the other x’s are found in exactly the same 
manner. The general result is 

EdAin 

x n - — IV-6 

A 

A solution exists only if A 5* 0, or, if A = 0, a solution exists only if all 
the c’s are zero. Conversely, we may say that, if all the c y s are zero, 
then a solution (aside from the trivial solution x\ = X2 = • • • = x n = 0) 
exists only if A = 0. This last statement is the one of particular impor- 
tance in quantum mechanics. 


V. THE EXPANSION OF — 

r»; 

In terms of the distances r,- and rj and the angle y between the vectors 
from the origin to the two particles, the distance r, ; - between the two 
particles is 

r,y = Vrj + rf — 2 r,ry cos y V-l 


If we let r> be the greater of r,- and r,, and r< be the lesser, then 

r,-y = r> V 1 + z 2 — 2* cos y V-2 


where* = 


l< 

r>' 


Let vis now look for an expansion of — of the type 

r a 


r ij 


Jl 1 

r > Vl + x 2 — 2x cos 7 


— T.a„P n (cosy) 
r > » 


V-3 


If we square both sides of this expansion, multiply by sin y dy, and inte- 
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grate over 7 from 7 = 0 to 7 = ir, we obtain, by 4 - 53 , the result 

1 1 (1 H~ jO _ -r— 2 2 

x ° g (1 - x) ~ ? 2n + 1 

But, by an expansion, we have 


U (1+*) = y 2 

x 8 (1 - x) 7 2n + 1* 


,2n 


V -4 


V-5 


so that we conclude that a„ = x", and the expansion of — in terms of 

Ui 

P* (cos 7) is 

— = — Lx n P n (cos 7) V-6 

Uj r > n 


We must now express P n (cos 7) as a function of the 0’s and of the two 
particles. Considering this as a function of 0* and w, we may expand it 
in terms of the orthogonal functions P| wI (cos 0*-) e xmif>i . P n (cos 7) is a 
solution of the equation 


sin 0, 


d_ 

M. 


fa**) 


+ 


1 a 2 P n 

sin 2 di d<Pi 


+ n(n + 1 )P n = 0 V *7 


since this equation remains unchanged under any rotation. The general 
solution of this equation is a linear combination of the functions 
P|, m| (cos 6{) e tmVi , so that we may express P n (cos 7) as 

m=»4-n 

Pn (cos 7) = L A nm p { ™ ] (cos 9{) e imVi V S 

n 


where, by 4 - 53 , 

^ - 2j fa trwi f K (ros Y) pr ' (C08 #i) e ~‘”‘ * v ' 9 

We also can expand Pjj” 1 (cos di) in terms of the functions 

P]f' (cos 7) e lkv as 

Pl m| (cos di) = ZBnkP'n' (cos 7) e ikv V-10 


where 


B nk = jtTW- f P " m ' (C ° S 6i) e ~ imnP " (cos y) e ~ ikV dT V41 

Equation V* 10 must hold for 7 = 0 , that is, for di = dj,<pi = <pj. Then 

Pl m| (cos di) = * 2 BnfcPi? 1 (1 ) = B n0 P°n( 1 ) V-12 
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since P*(l) = 0 for k ^ 0. P°(l) = 1; from V*12 and V-ll we have 
J Plr 1 (cos 9i) e~ imv <P„ (cos 7 ) dr = Pj, m| (cos 6j) e~ imr < V-13 


so that 




V-14 


1 


Combining V-6, V-8, and V-14, we have the final result 

= £ m^pw (cm*, )«*-<«-*> v.w 

7 i; «»0m*-n i |W])l r> 

Equation V-15 may alternatively be expressed as 

- = £ m £ + ” r-xi -§I ©L m| (®i)©L m| (*,•) V-16 

zn. i r> 


r H 


n=0 m 

1 00 ffl “+ n 4^ rj- 

-=E E 7 p&UTfo, «)!?*(«**) 

n—o m=— i»2w T 1 r> 


Y-17 


VL PROOF OF THE ORTHOGONALITY RELATIONS 

The most important single theorem in group theory is that giving the 
orthogonality relation between the irreducible matrix representations 
of any group. As stated in Chapter 10, this theorem is 

5£r» (^)mn Tj(R) m r n ' = hijb m m^ nn f VLl 

where U and lj are the dimensions of the representations and h is the 
order of the group. We wish to give a simple proof of this theorem, 
following essentially the method of Speiser. 1 Before we can begin the 
actual proof of the orthogonality relations, we need several preliminary 
theorems. 

Theorem 1. If we have two sets of variables %[ • • • x' n and y[ • • • y' m , 
then every bilinear form 

/= E 'Lc ij x' i y' j VI-2 

»-l i«i 

of these variables can be reduced to the normal form 

• / = ExkVk VI-3 

1 

1 A. Speiser, Theorie der Guppen von endlicher Ordnung , Springer, Berlin, 1927. 
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where r < n, r < m, by a suitable linear transformation of the variables 
x'i and y'j. 

Proof. The product 

— (Lc,iX.O(Zci,l/;) VI -4 

Cll *- 1 ;~1 

contain all the terms in / which involve either x[ or y[. If we make the 
substitutions 

1 n 

*1 = —7= £ CilXi 
veil'- 1 

1 m 

2/i = / — X) CijVj 

V Cn i“i 

we may write / as 

n m 

/ = xiyi + EE 

»*=2 y -2 


VI-5 


VI-6 


We can, without loss of generality, assume that n < m. After (n — 1) 
substitutions of the type VI-5, we will have obtained the result 


/ = XiVi + X 2 S/2 H 1" + 


tn 


y-n 


We now make the final substitutions 


VI-7 


m 

X„ = x' y n = £ Vi = O' > n) 

y«n 

Equation VI-7 then reduces to 

f^ZxiVi VI-8 

*«i 

which is the desired result. As the determinants of the transformations 
on the a/’s and y h s are different from zero, the transformations have 
reciprocals. In certain special cases the normal form VI-8 may contain 
less than n terms. 

Let us now take a set of variables x[ • • • x f n which form a basis for an 
irreducible representation IV of a group. We also take a set of vari- 
ables y[* • • y' m which form a basis for an irreducible representation r y / 
of the same group. We have then 
Theorem 2. If IV and Y yf are two irreducible representations of a 
group there is no bilinear form of the variables x[ and y'j which is always 
invariant when both the x\ and the y'j are subjected to some operation R 
of the group unless IV is identical with IV . 
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Proof . We shall prove this theorem only for the type of groups in 
which we have been interested, namely, those representing transforma- 
tions of coordinates. The corresponding matrix representations involve 
only unitary matrices; for simplicity, we will assume that the matrices 
are real. According to Theorem 1, any bilinear form 

f-iZcM VI-9 

can be reduced to the form 

/- 5>Jb Vk VI-10 

km 1 


by a suitable transformation. We consider that the matrices of the 
representations IV and T v r have been subjected to the same transfor- 
mation, so that we have obtained the corresponding new representations 
T x and r„ which have the x’b and y’ s as their bases. We now require 
that / be invariant when both the x’s and the y’a have been operated on 
by some operation R of the group. 


Rx k = ET x (R)skX* 

•-i 

Ryk = Er v(R)tkyt 

t~i 


VI-11 


If we operate on the x’b only in equation VI- 10, we have 
/ = 2/i£ Tx(R)*\x 9 + y2lL?x{R)*2x B H h y r £ v^R)^ VI-12 

a-1 «-l ami 


Arranging this according to the x’b we have 

/^il r x (R) lk y k + • • • + x n E T x (R) nk y k VI-13 

kml Jfc-1 


When we operate on the y’ s, equation VI- 13 must reduce to equa- 
tion VI- 10. This requires that 

£ r x(R)ikRyk = y% i = l - • • r 

kml 


which is equivalent to the requirement that 


RET x {R) ik y k = yi 

km 1 


or 

RT l Vi « £ r x {R)ikVk i « 1 • * • r VI- 14 
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For the real unitary matrix representations we have been considering, 
the matrix of the inverse transformation is obtained from the original 
matrix by simply interchanging rows and columns. By definition, 
therefore 

R-'yi = £ r v (R) ih y h i = 1 • • • m VMS 

k - 1 

Comparing VI- 14 and VI-15, we see that VI-10 will be invariant only if 
m = r. By interchanging the order of operations, we could prove in 
the same way that VI-10 will be invariant only if n = r. Again com- 
paring VI-15 and VI-14, we see that VI-10 is invariant only if 
T x {R)ik = T v (R) ik for i, k — 1 • • • r. Equation VI-10 is, therefore, 
invariant only if T x and V y are identical. 

We now have the necessary background for the proof of the orthogon- 
ality relations. Taking the sets of functions Xi • • • x n , y\ • • - y n > which 
are the bases for the irreducible representations T* and T y , we have 
Theorem 3. If r* is not identical with r„, then 

= o 

r 

for all values of i, j, k, l. 

Proof. Referring to the definition of the direct product given in 
Chapter 10, we see that the mn functions x,y t form a basis for a repre- 
sentation r x r„ of the group of dimension mn. If we denote these mn 
functions by z\ • • • z r (r = mn) and the corresponding representation 
by T, = r x r v , then the r 2 matrix elements of T t (R) are T x (R)ijT y (R) k i 

(i, j = 1 • • • n; k, l = 1 • • • m). If we now operate on z, by one of the 

operations R of the group, we have 

Rz. = £,T,(R)t.z, VI-16 

<-i 

Summing over all the operations R of the group, we have 

/ = EE2, = zi.T,(R) t ,z t VI-17 

R R 1-1 

Let A be any operation of the group. Then 

Af - J^ARz, = £f?z, = f 

R R 

since AR is always an operation of the group and the operation by A 
merely changes the order of the summation. Now / is a linear form of 
the z’s, and hence a bilinear form of the x’a and y’e. But we have just 
seen that there is no such form / which is invariant under an operation 

if the croiin. Since Af = f. we must, therefore, conclude that, f is 
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identically zero. This can be true only if Hr, (R) t , is identically zero . 

s 

for all values of t and s. Referring to the correlation between the ele- 
ments of r a (R) and those of T X (R) and r„(f2), we thus see that the 
theorem is true. 

Now taking the sets of functions x\ • • • x n y k • • • y r , which are bases 
for the representation T, and noting that according to Theorem 2 the 
function 

i 

/ = T.HVk 

km 1 

is invariant under operations of the group, we can prove 
Theorem 4. 

Zr(R) {j T(R) {J = h - ; ZT{R)iiT{R) mn = 0 (ij) * (mn) 

R 1 R 

where h is the order of the group and l is the dimension of the 
representation. 

i 

Proof . The function / = J^x k y k is invariant. We operate on / with 

kml 

some operation R, sum over all operations of the group, obtaining 

ZRf = £T(R) 8k r(R) tk x.y t = hf VI* 18 

R R 

The coefficient of x t y t must vanish if * ^ t, so we immediately have the 
relation 

ZT(R)'kT(R)tk = 0 8 9* t VI-19 

R 

Now J^RT l f = Z#/, since each operation is contained once and only 

R R 

once in both summations. Recalling that the inverse matrices are 
obtained by interchanging rows and columns in the original matrix, 
we have 

HRS = HIT'S = I££I T(R) k ,T(R) kt x.y t 

R R Rk-l—lt-1 

which gives us immediately the relation 

HT(R)k.T(R) it = 0 s*t VI-20 

R 

Equation VI- 18 is thus reduced to 

HRS - H E - hs 

R R k-lj-1 


VI-21 
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This requires that 

E Zr(R) jk T(R) )k = h (j -I”-*) VI-22 

R k-1 

Considering the inverse transformation, we also obtain 

E E r (fi) M r («) = A O' - i • • • J) vi-23 

A t-1 

From VI-22 and VI-23, we see that £r(I2)#r(.R),-* is independent of 

R 

both k and j. Therefore 

ZT(R) jk T(R) ik = 7 

Combining the results of Theorems 3 and 4, we have the desired relation 

Er,-(i 2 ) mn ^ r,-(R vi-24 

vn. THE SYMMETRY GROUPS AND THEIR 
CHARACTER TABLES 

Most simple molecules will have a certain degree of symmetry; that 
is, there will be certain transformations of coordinates which leave the 
atoms of the molecule in a configuration in space which is indistinguish- 
able from the former configuration. The possible transformations of 
this type will be either rotation about an axis of symmetry, reflection 
in a plane of symmetry, inversion in a center of symmetry, or various 
combinations of these transformations. If two such transformations 
are carried out successively, the configuration will be that which could 
be obtained from some other transformation. The set of transforma- 
tions which do not alter the configuration of the atoms in a molecule 
thus form a group, the group of the symmetry operations for the mole- 
cule. We include in this appendix the character tables for most of the 
symmetry groups which are likely to occur in problems of molecular 
structure . 1,2,8 

The notation used for the operators which transform a symmetrical 
molecule into itself is the following: 

E = the identity operation, which leaves each particle in its original 
position. 

1 J. Rosenthal and G. Murphy, Rev. Modem Phyt., 8, 317 (1036). 

1 H. Sponer and E. Teller, Rev. Modem Phyt., 18, 75 (1941). 

* R. Mulliken, Phyt. Rev., 48, 270 (1933). 
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C n = rotation about an axis of symmetry by an angle — • In prob- 

n 

lems of molecular structure, the values of n \yhich will be of particular 
interest are n = 1 (no symmetry axis), 2, 3, 4, 5, 6, and w . (n = 7, 8, 
etc., are possible values but probably occur only occasionally; these 
groups are not included in the tables.) 

a = reflection in a plane of symmetry. The symmetry planes are 
further classified as follows. If the plane is perpendicular to the princi- 
pal axis of symmetry (the axis with the largest value of n), reflection in 
this plane is denoted by <j h . If the plane contains the principal axis, 
reflection in this plane is denoted by <j v . If there are axes with n — 2 
perpendicular to the principal axis, and if the plane contains the princi- 
pal axis and bisects the angle between two of these 2-fold axes, reflection 
in this plane is designated by a d . 

2t 

S n = rotation about an axis by — followed by a reflection in a plane 

n 

perpendicular to the axis of rotation. 

i = inversion in a center of symmetry. 

The symmetry groups are designated by the following notation. We 
may divide them into three main types: 

I. The Rotation Groups. These are symmetry groups which have 
one symmetry axis which is of a higher degree than any other symmetry 
axis. This axis is considered to be the z coordinate axis. The following 
groups belong to this general class: 

A. The molecule possesses an axis of symmetry only. The groups 
of interest are Ci, C 2 , C 3 , C 4 , C 6 , C 6 . The possible operations are found 
as follows. For the group C 6 , with a 6-fold axis, all powers of C$ are 
likewise symmetry operations. We thus have the operations 

C rt2 __ si __ r\ /nr4 _ ri2 r\ (5 

'-'d ~ ~ ^ 2 ) ^6 ~ ^6 

B. The molecule has the symmetry operations C» and <r v . The 
manner in which the operations are found will be discussed later. The 
possible groups are C 2v , C 3o , C 4 », C 5 „, C 6 „. 

C. The molecule has the symmetry operations C n and <r*. The 
possible groups are C U (C,),C 2A , C 3A , C ih , C 5h , C 6h . 

D. The molecule has the symmetry operation S n . The possible 
groups are S 2 (C,), S 4 , S 6 (C 3 ,). 

E. The molecule has 2 n 2-fold axes perpendicular to the principal 
n-fold axis. These axes are denoted by C', 0%, etc. The notation for 
this type of group is D n . The possible groups are D 2 (V) , D 3 , D 4 , D 5 , D 3 . 

F. The molecule has the symmetry operations D„ and <r<j. The 
possible groups of interest are D 2 < i(V d ), D 3<i . 
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G. The molecule has the symmetry operations D ny cr df and <r h . 
The possible groups are D 2 *(V*), D 3 ^, & 4h , 0 5hy D 6A . 

A number of the above groups can be expressed quite simply as the 
combination of some other group of symmetry operations plus the 
inversion i . These are 

C4 h = = C 4*1 D2 h = 

C Qh = Ce x i D 4 * = D 4 *i 

Se = C 3 x i Do* = 

D 3 d = D 3 x i 

II. Groups of Higher Symmetry. These are groups which have no 
unique axis of high symmetry but which have more than one n-fold 
axis where n > 2 . The groups of interest are 

T = the group of operations which sends a regular tetrahedron into 
itself. Tft = T*i\ 

O =* the set of operations which sends a cube or a regular octahedron 
into itself. Oh = 0*i. 

T d = the symmetry group of CH±. 

III. Groups with the Symmetry Operation Coo. 

Coo* = the group of the symmetry operations of a heteronuclear 
diatomic molecule. 

Deo h = the group of the symmetry operations of a homonuclear 
diatomic molecule. 


The symmetry operations for the rotation groups can most easily be 
found by means of the stereographic projection diagrams. For the 
group 04*, this takes the form of Figure VII* 1 . The square in the center 
represents the 4 -fold axis. Starting with point 1, where the + indicates 
that it is above the plane of the paper, we can ob- 
tain the points 3 , 5 , and 7 by applying the opera- 
tions C 4 , C4, and C4 to this point. Applying the 
operation <r v to point 1 we obtain point 2; from 
this point we obtain the points 4 , 6, and 8 by the 
rotations. These eight points are all that can 
be obtained from a single point by any combina- 
From the diagram, we see 


A 

^S— 


<Tv_ 

T' 

3 


& 






V 7 8 y 

Pig. VII I. Stereographio tion of C4 and cr.. _ 

projection diagram for the se t 0 f points also possesses the sym- 

group **■ metry elements aa, where a a represents a reflec- 

tion in a plane which bisects the angle between fhe original <r„ planes 
of symmetry. 

Diagrams of this type are given in Figure VII-2 for all the rotation 
groups. The principal n-fold axis is represented by the n-sided shaded 
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Fig. VII 2. Stereographic projection diagrams for the simple point groups. 
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figure in the center of the diagrams. S n axes are represented by open 
n-sided figures. If the symmetry operation ah is present, the large circle 
is full; otherwise the large circle is dotted. The planes <r* and 0 ^ are 
denoted by full lines. Two-fold axes perpendicular to the principal 
axis are denoted by the symbol for a 2-fold axis placed at the ends of the 
line through the center of the circle. The symbol + represents a point 
above the plane of the paper; the symbol O represents a point below 
the plane of the paper. All these points can be obtained from any one 
of them by application of the various symmetry operations. As the 
exact significance of the various symmetry groups can be better seen by 
means of examples, in Table VII* 1 we show molecules belonging to the 
various symmetry groups. 

In the character tables, the operations of the various groups are 
collected into classes. For example, the group C 3v has three classes: 
the class of E ) the class of the two 3-fold axes, and the class of the three 
planes of symmetry. This may readily be verified from the definition 
of a class and the stereographic projection diagrams. 

The characters of the irreducible representations of the groups of 
most frequent occurrence are integers. For certain of the groups of low 
symmetry, particularly the C„ groups, complex characters occur. 
When they do, the irreducible representations can be taken in pairs, the 
characters of one member being the complex conjugates of the corre- 
sponding characters of the other member of the pair. Such a pair of 
representations is essentially equivalent to a single doubly degenerate 
representation. 3 When applying the orthogonality rules to such 
representations, the rule 

Ex'Wx'W - Mi y 

R 

must be replaced by 

L[x y (2?)]V(fi) = hdij 

R 

2rt 

etc. Also, it is to be recalled that, if co = e 6 , for example, then 
1 “f~ to -(- U? Hf- CO 3 -f* CO* CO® = 0 

Non-degenerate representations are designated by the letters A and B. 
The A’& are symmetrical and the B ’ s are antisymmetrical with respect to 
rotation about the principal axis of symmetry, that is, about the z axis. 
Doubly degenerate representations are denoted by E and triply degener- 
ate representations by IF. The characters of the groups of the type 
D 6 fc = D are not given explicitly. They may readily be found in the 
manner illustrated by Table 14*7 for the group D 6 *. These represents- 
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tions are denoted by the symbols Ai„ and Ai„, etc., the g representations 
being symmetrical and the u representations being antisymmetries! with 
respect to inversion. The transformation properties of the coordinates 
x, y, z, the products of coordinates x 2 , y 2 , z 2 , xy, xz, yz, and the rotations 
R x , R v , Ri, about the x, y, z axes, respectively, are also given. If the 
operation i is present, the coordinates belong to the u representations 
while the rotations and products of coordinates belong to the g represen- 
tations, as may be seen from Table 14-7. 

TABLE VIM 

Examples of Molecules Belonging to Various Symmetry Groups* 


D w * H2, Oj, C2H2, CO2 

C«» CO, HC 1 

C 2 » H2O, SO2, H2S, CW-C2H2CI2 

C 3r NHs, PC 1 8 , CH3CI 

C 2 k iratw-CsHsCls 

Dm “ Staggered ” C 3 H 8 

D 3/ , “ Eclipsed ” C 2 H 8 

D<n C,H, 

Td CH 4 

o* SF„ 

•See also E. B. Wilson. J . Cham . Phya ., S, 432 (1934). 


The characters themselves are easily found for any group by applica- 
tion of the four rules given in Chapter X. It may perhaps be of value 
to determine the transformation properties of the coordinates and rota- 
tions in several instances. For C 2 „, for example, the various operations 
on the vector r, with components x, y, z, give us the matrix equations 



<r„ - reflection in xz plane. 
o' % = reflection in yz plane. 


Each coordinate therefore belongs to an irreducible representation, 
* to Bi, y to B s , and z to Ai. .The products of the coordinates belong to 
the representation of the direct product of the irreducible representa- 
tions involved; for example, xy belongs to B\Bi = A* The various 
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rotations may be indicated by curved arrows as indicated in Fig- 
ure VIL3. Operating on these arrows with the symmetry operations, 
we have 


/R x \ / 1 
E ( R v ) ■ ( 0 

\rJ \ o 




/R x \ /-I 
I Ry I — I 0 

\rJ \ o 

/R x \ ( 1 

a 'v ( By ) = ( 0 

\R z j \ 0 



so that R x belongs to B 2 , R v to Bi, and R z to A 2 - 


Z 



Fig. VII-3i 


When we have degenerate representations the procedure is sometimes 
less simple. For example, for D 3 a, we have the matrices 



0 0 \ 
1 ° 

0 1 / 




/ 1 
• 2 

2 

0 



0^ 





\ o o' if 




0 

-1 

0 


p 

0 

1 
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We thus see that z belongs to the non-degenerate representation A 2 , 
and that the pair of coordinates ( x , y) belongs to the degenerate repre- 
sentation E\ The product z 2 belongs to A 2 A 2 = A[, the pair ( xz , yz) 
belongs to A 2 E' = E The products x 2 , y 2 } and xy must belong to 
E'E' = E' + A[ + A 2 . If we consider any vector of unit length, then 
x 2 + y 2 + z 2 = 1. Since z 2 belongs to A' h the combination x 2 + y 2 
must belong to A [ . This leaves us xy and the combination x 2 — y 2 
to be assigned to E f + A 2 ; we conclude that the pair (x 2 — y 2 , xy) 
belongs to E\ These conclusions may be verified by working out the 
transformation matrices for the vector with components x 2 + y 2 } 
x 2 — y 2 , xy. In a similar manner, we find that R z belongs to A 2 , the 
pair of rotations ( R x , R y ) belongs to E ” . 


Cl 

E 

A 

1 


C 2 

E 

c 2 

x 2 , y 2 , z 2 , xy 

Rg, z 

A 

1 

1 

xz, yz 

y \ 

Rx 1 Ry J 

B 

1 

-1 


C s 

E 

Cl 

Cl 


x 2 + y 2 , z 2 

Rg, z 

A 

1 

1 

1 

2 Ti 

(xz, yz) \ 

(x, y) \ 

E { 

1 

CO 


(co = e 3 ) 

(x i - y 2 , xy)} 

{Rx j Ry) ) 

1 

CO 2 

CO 



C4 

E Ci Ci C\ 

X 2 A y 2 , z 2 
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E SCb 3C 2 6C 2 6C4 


Active 

Inactive 

Active 


Inactive 


(Rx, Ry, 


0 -1 -1 + 

0 -1 +1 


0/t * 0*i 


E 8C S 3C 2 6<ra 6£ 4 


A 1 

A 2 

E 

(Rx, Ry, R*) T\ 

(x, y, z) T 2 


1 1 

-1 -1 
0 0 
1 —I 
-1 1 


x 2 + y 2 , z 2 

(xz, yz) 

(x 2 - y 2 , xy) 


(*, V ) 
(R Xi Ry) 


2 cos <p 
2 cos 2 * 



•DoO h 


a: 2 + V s , * 2 


A\g 

A\u 

A 2o 


Z 

a 2u 

(a», j«) 

(Rx, Ry) 

El g 


(z, y) 

Eiv 

(** - y \ xy) 


E 2 g 

E 2u 


2 2 cos 2* 0 


-2 cos* 0 

2 cos 2* 0 

-2 cos 2* 0 , 


dx 


vm. SOME SPECIAL INTEGRALS 

f, 

a aj 
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2 n+1 


xe-^dx 


xh~ ax *dx * 


s: 

■s: 
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J r»oo 

■ 

o 

•I 

■s; 

/»oo 

12. x n i 


yjj* 1 


6 

7 

8 

9 

10 


.Zn-H^—ox* 


e~’ M da; = 


xe~ ax dx 


2 a 2 
nl 

2a n+1 


(1 -f a) 


'e-~d*=^ r Es^.W 

a n+1 


f+l j 

13. I e~* x dx = -(e° -e-°) 

*7—1 a 


/ -H 1 

aaT° x dx {e a ~ e~° ~ a(e° + e - *) } 
-l a 

r +l 

15. J ^ x n e~ ax dx = (-l) n+1 A„(-a) - A n (a) 
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Angular momentum operators, 42 
Angular momentum, total, 134 
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Associated Legendre polynomials, 59 
Atomic orbitals, 129 
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radial part, 162 
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molecular orbitals treatment, 254 
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Black-body radiation, 1 
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Bohr’s theory of hydrogen, 3 
Bond eigenfunctions, 234 
for four electrons, 238 
for six electrons, 239 
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Rumer’s theorem, 240 
Bose-Einstein statistics, 287 

Character of a representation, 182 
orthogonality relations, 183 
Character tables, 383 
Class, definition, 177 
number of classes, 182 
Commutation rules for angular mo- 
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Complex atoms, angular momenta of, 133 
closed shells, 131 
electronic states, 128 
energy levels, 132 
energy matrix, 143 
fine structure, 151 
selection rules, 159 
“ terms ” of, 141 
vector model, 155 
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Compton effect, 21 
Conjugate variables, 15 
Coordinate systems, 367 
Coulombic energy, 214 
for complex molecules, 247 
Coulombic integral, 149 
Covalence, normal, 170 
Covalent bond, 218 
Covalent double bond, 223 
Crossing of potential surfaces, 327 

Degeneracy, accidental, 186 
Degenerate eigenvalues, 96 
Degrees of freedom, definition, 9 
Depolarization of Raman lines, 279 
Deter minantal eigenfunctions, 130 
Diamagnetism, 349 
Diatomic molecules, correlation dia- 
grams, 207, 210 
electron configurations, 207 
electronic states, 190 
heteronuclear, 209 
homonuclear, 203 
nuclear spin, 265 
selection rules, 262 
symmetry of rotational states, 263 
vibrational and rotational energy, 268 
Dielectric constants, 337 
Dipole moments, 337 
matrix element, 112 
Direct product, 187 
Directed valence, 220, 227 
stable bond arrangements, 231 

Eigenfunctions, antisymmetric, 130 
determinantal, 130 
definition, 26 
linearly independent, 96 
of angular momenta operators, 42 
of commuting operators, 34 
Eigenvalues, definition, 26 
of angular momenta operators, 42 
of Hermitian operators, 27 
Einstein photoelectric hypothesis, 3 
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Einstein transition probabilities, 114 
Electromagnetic field, 108 
Electron, charge and mass of, 1 
dual nature of, 7 
magnetic moment of, 125 
Electron configurations, and directed 
valence, 231 
of complex atoms, 128 
of diatomic molecules, 207 
of the elements, 168 

Electron density distribution, by the 
Hartree method, 166 
in hydrogen, 87 
Electron-pair bond, 218 
Electron spin, 124 
angular momentum, 125 
quantum numbers, 125 
Electronic states, of complex atoms, 128 
of diatomic molecules, 190, 203 
of H 2 + ion, 201 

of heteronuclear diatomic molecules, 
209 

of homonuclear diatomic molecules, 
205 

Electronic structure of complex mole- 
cules, 232 

coulombic and exchange energies, 247 
four-electron problem, 245 
London formula, 246 
“ semi-empirical ” method, 246 
Eulerian angles, 259 
Exchange energy, 214 
in complex molecules, 247 
Exchange integral, 149 
Exclusion principle, 129 

Fermi-Dirac statistics, 285 
Fine structure, 151 
Free particle, 68 
Functions, class of, 26 

Generalized coordinates, 8, 365 
Generalized momenta, 14 
Group, Abelian, 177 
definition, 176 
element of, 177 
order of, 177 
representation of, 178 
Group theory, 172 
and quantum mechanics, 184 


Group theory, and vibrational spectra, 
274 

proof of orthogonality relations, 371 
Gyromagnetic ratio, 127 

Hamiltonian, classical, 14 
Hamiltonian operator, 29 
for charged particle in electromagnetic 
field, 108 

in generalized coordinates, 39 
Hamilton's equations, 14 
Harmonic oscillator, 75 
selection rules, 117, 123 
Hartree method, 163 
Heitler-London theory, 213 
Helium atom, perturbation method, 101 
variation method, 104 
Hermite equation, 61 
Hermite polynomials, 61 
normalization of, 63 
recursion formula for, 62 
Hund's rule, 151 
Hybridization, 220 
Hydrogen atom, 80 
continuous spectrum of, 90 
energy levels of, 82, 90 
selection rules for, 116 
Hydrogen molecule, Heitler-London 
treatment, 213 
singlet and triplet states, 216 
variation treatment, 216 
Hydrogen molectile ion, 194 
electron density distribution, 198 
electronic states, 201 
energy levels, 204 
Hydrogen quantum numbers, 85 
Hydrogen wave functions, 85 
Hydrogenlike atoms, 84 
Hydrogenlike wave functions, 89 

Indicial equation, 51 
Induced dipole moment, 118, 333 
Integrals, table of, 388 
Ionization potentials, 168 
Irreducible representations, 180 

Lagrange's equation, 9 
Lagrangian function, 11 
Laguerre equation, 66 
Laguerre polynomials, 63 
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Lande interval rule, 158 
Laplacian operator, 363 
Legendre equation, 52 
Legendre polynomials, 52 
recursion formula for, 59 
Light, dual nature of, 6 
Linear differential equations, 48 
Linear equations, simultaneous, 368 
Linearly independent eigenfunctions, 96, 
Lorentz field, 341 

Mathieu's equation, 359 
eigenvalues of, 360 
Matrices, 172 
unitary, 174 

Maxwell-Boltzmann statistics, 283 
Maxwell's equations, 343 
Molecular orbitals, 192 
quantum numbers, 203 
separated atom notation, 207 
symmetry properties, 205 
united atom notation, 203 
Molecular spectroscopy, 258 
Moments, induced, 333 
Morse curve, 272 
Multiplet structure, 135 

Neutron, 1 

Non-adiabatic reactions, 326 
Normal coordinates, 16 
of acetylene, 280 
of water, 278 
Normalized functions, 31 
Nuclear spin and diatomic molecules, 265 

Operators, angular momenta, 42 
commuting, 25 
electron spin, 126 
Hamiltonian, 29 
Hermitian, 27 
linear, 27 

magnetic moment, 127 
Optical rotatory power, 342, 346 
Orbitals, atomic, 129 
bonding, 208 
molecular, 192 

Ortho- and paradeuterium, 267 
Ortho- and parahydrogen, 266 
Orthogonal functions, 31 
Overlap integral, 197 


Paramagnetism, 348 
and electron spin, 349 
Particle in a box, 70 
Partition functions, 290, 292 
Pauli exclusion principle, 129 
Penetration of potential barriers, 302 
Periodic table, 168 
Perturbation theory, 93 
for degenerate systems, 96 
Perturbations, time dependent, 107 
Photoelectric effect, 2 
Photon, 3 

Physical constants, 361 
7T bonds, 224 
Planck's constant, 2 
Polarized light, 110 
Polarizability, 118, 121 
Potential energy surfaces, 301 
Promotion, 221 
Proton, charge and mass, 1 

Quantum mechanics, postulate^ 29 
Quantum theory, old, 5 

Radiation, induced, 110 
spontaneous, 114 
Radiation density, 113 
Raman effect, 121, 123 
Rayleigh scattering, 121 
Reaction rate theory, approximate equa- 
tion, 307 

external forces, 331 
general equation, 306 
thermodynamics, 330 
unimolecular reactions, 308 
Reduced mass, 81 
Reflection coefficient, 306 
Representation of a group, basis of, 185 
dimension of, 181 
irreducible, 180 
Resonance, 248 

Resonance energy of benzene, 249, 251, 
254 

Restricted rotator, 358 
Rigid rotator, 72, 75 
Rotational energy, diatomic molecules, 
268 

Rotatory strength, optical, 345 
Russell-Saunders coupling, 155 
Rydberg constant, 4 
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Scalar potential, 108 
Schrddinger's equations, 24, 25, 29 
Screening constants, 163 
Secular equation, 98 
Selection rules, for complex atoms, 159 
for diatomic molecules, 262 
for harmonic oscillator, 117 
for hydrogen, 116 
for vibrational spectra, 276 
Self-consistent field, 165 
Separation of wave equation, 190, 258 
a bonds, 224 

Similarity transformation, 179 
Singular points, 50 
Slater orbitals, 163 
Spherical harmonics, 47, 58 
Spin functions of hydrogen, 215 
Spin theory, complex molecules, 232 
Spinning electron, 124, 125 
Spin-orbit interaction, 151 
State function, 28 

Statistical distribution laws, 283, 285, 
287 

Statistical mechanics, 282 
and thermodynamics, 289 
Statistical weight, 290 
Stereographic projection diagrams, 379 
Sum-over-states, 2&Q 
Symmetrical top, 260 
Symmetry groups, 376 
Symmetry number, 295 
Symmetry operations, 176 

Tetrahedral orbitals, 223 
Time-dependent perturbations, 107 


Transformations of coordinates, 174 
Transition probabilities, 114 
Transmission coefficient, 304, 306 
for Eckart barrier, 311 
for special cases, 317, 320, 323 
Trial eigenfunctions, 100 

Uncertainty principle, 21 
Unpaired spins, 170 

Valence orbitals, 218 
of N, O, C, 220, 221 
Valence structure, of acetylene, 224 
of benzene, 249 
of water, 225 

Van der Waals* forces, 351 
Variation method, 99 
Vector model of atoms, 155 
Vector operators, 363 
Vector potential, 108 
Vectors, 175, 361 
Vibration theory, 16 
Vibrational spectra, and group theory, 
274 

of acetylene, 279 
of benzene, 280 
of water, 276 
selection rules, 276 
Virial theorem, 355 

Wave number, 4 

^Zeeman effect, 162 
Zero-point energy, 77 
Zeroth-order approximation, 97 










